7 results found Sort:
- Filter by Primary Language:
- Python (5)
- Jupyter Notebook (1)
- Rust (1)
- +
Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per second 🚀
Created
2014-09-27
3,647 commits to master branch, last one 3 months ago
the portable Python dataframe library
Created
2015-04-17
9,233 commits to main branch, last one a day ago
Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet format. It supports ML frameworks such as Tensorflow, Pytorch, a...
Created
2018-06-15
691 commits to master branch, last one about a year ago
Lightweight and extensible compatibility layer between dataframe libraries!
Created
2024-02-19
2,371 commits to main branch, last one 19 hours ago
Work with bioinformatic files using Arrow, Polars, and/or DuckDB
Created
2023-04-22
310 commits to main branch, last one 14 days ago
Command-line interface to quickly generate fake CSV and JSON data
Created
2023-05-25
39 commits to main branch, last one 6 months ago
Exploring Chicago crimes dataset with Jupyter notebooks, DuckDB, Malloy and new Panel/PyScript data and dashboard tools.
Created
2022-10-09
243 commits to main branch, last one 2 years ago