23 results found Sort:
- Filter by Primary Language:
- Rust (15)
- Java (2)
- Dockerfile (1)
- Python (1)
- Shell (1)
- Jupyter Notebook (1)
- Go (1)
- +
Apache DataFusion SQL Query Engine
Created
2021-04-17
9,494 commits to main branch, last one 3 hours ago
the portable Python dataframe library
Created
2015-04-17
9,112 commits to main branch, last one 7 hours ago
Create full-fledged APIs for slowly moving datasets without writing a single line of code.
Created
2020-12-11
268 commits to main branch, last one 19 days ago
LakeSoul is an end-to-end, realtime and cloud native Lakehouse framework with fast data ingestion, concurrent update and incremental data analytics on cloud storages for both BI and AI applications.
Created
2021-12-28
1,157 commits to main branch, last one 26 days ago
Blazing-fast query execution engine speaks Apache Spark language and has Arrow-DataFusion at its core.
Created
2021-06-28
1,084 commits to master branch, last one 6 hours ago
Apache DataFusion Comet Spark Accelerator
Created
2024-01-15
575 commits to main branch, last one a day ago
LakeSail's computation framework with a mission to unify stream processing, batch processing, and compute-intensive (AI) workloads.
Created
2023-12-21
264 commits to main branch, last one 2 days ago
Analytical database for data-driven Web applications 🪶
Created
2022-07-04
1,441 commits to main branch, last one 2 days ago
DuckDB-powered analytics for Postgres
Created
2024-05-09
88 commits to dev branch, last one 2 days ago
Next-generation decentralized data lakehouse and a multi-party stream processing network
Created
2019-05-19
1,623 commits to master branch, last one a day ago
Rust implementation of Apache Iceberg with integration for Datafusion
Created
2022-12-20
591 commits to main branch, last one 11 days ago
Query and transform data with PRQL
This repository has been archived
(exclude archived)
Created
2022-10-11
140 commits to main branch, last one about a year ago
An opinionated and batteries included DataFusion implementation.
Created
2022-03-18
294 commits to main branch, last one 8 days ago
Java binding to Apache DataFusion
Created
2021-10-12
167 commits to main branch, last one 6 days ago
A lightweight Logging and Tracing observability solution for Rust, built with Apache Arrow, Apache Parquet and Apache DataFusion.
Created
2022-01-03
229 commits to master branch, last one 3 months ago
etl engine 轻量级 跨平台 流批一体ETL引擎 数据抽取-转换-装载 ETL engine lightweight cross platform batch flow integration ETL engine data extraction transformation loading
Created
2022-04-21
239 commits to main branch, last one 20 days ago
S3 as an ObjectStore for DataFusion
Created
2022-01-04
108 commits to main branch, last one 2 years ago
Exon is an OLAP query engine specifically for biology and life science applications.
Created
2023-05-28
886 commits to main branch, last one 4 days ago
Notes on Data Engineering with Pandas, PySpark, Dask, Ray, Arrow DataFusion, Polars etc.
Created
2022-06-19
490 commits to main branch, last one 3 months ago
Awesome list of alternative dataframe libraries in Python.
Created
2021-11-21
13 commits to main branch, last one about a year ago
Community InfluxDB 3.0 "IOx" static builds + containers + Examples for Developers & Integrators. Experiment with low-cost storage, unlimited cardinality and FlightSQL APIs
Created
2023-05-20
188 commits to main branch, last one 5 months ago
Scale to zero Seafowl hosting with Cloud Run
Created
2023-04-22
10 commits to master branch, last one about a year ago
Experimental Elixir bindings for Apache Arrow including Parquet and DataFusion
Created
2021-03-20
62 commits to main branch, last one 3 years ago