27 results found Sort:

1.4k
7.0k
apache-2.0
115
Apache DataFusion SQL Query Engine
Created 2021-04-17
10,305 commits to main branch, last one 9 hours ago
627
5.7k
apache-2.0
82
the portable Python dataframe library
Created 2015-04-17
9,539 commits to main branch, last one 5 hours ago
188
3.3k
apache-2.0
42
Create full-fledged APIs for slowly moving datasets without writing a single line of code.
Created 2020-12-11
290 commits to main branch, last one 7 days ago
405
2.7k
apache-2.0
254
LakeSoul is an end-to-end, realtime and cloud native Lakehouse framework with fast data ingestion, concurrent update and incremental data analytics on cloud storages for both BI and AI applications.
Created 2021-12-28
1,188 commits to main branch, last one 14 hours ago
149
1.4k
apache-2.0
23
Blazing-fast query execution engine speaks Apache Spark language and has Arrow-DataFusion at its core.
Created 2021-06-28
1,237 commits to master branch, last one 12 hours ago
192
922
apache-2.0
55
Apache DataFusion Comet Spark Accelerator
Created 2024-01-15
730 commits to main branch, last one 5 hours ago
24
704
apache-2.0
10
LakeSail's computation framework with a mission to unify batch processing, stream processing, and compute-intensive (AI) workloads.
Created 2023-12-21
342 commits to main branch, last one a day ago
21
522
postgresql
5
DuckDB-powered data lake analytics from Postgres
This repository has been archived (exclude archived)
Created 2024-05-09
128 commits to dev branch, last one 12 days ago
14
481
apache-2.0
9
Analytical database for data-driven Web applications 🪶
Created 2022-07-04
1,448 commits to main branch, last one 2 months ago
13
433
apache-2.0
1
High-performance Rust stream processing engine, providing powerful data stream processing capabilities, supporting multiple input/output sources and processors.
Created 2025-03-01
113 commits to main branch, last one 8 hours ago
14
311
other
16
Next-generation decentralized data lakehouse and a multi-party stream processing network
Created 2019-05-19
1,777 commits to master branch, last one 3 days ago
22
157
apache-2.0
8
Rust implementation of Apache Iceberg with integration for Datafusion
Created 2022-12-20
1,187 commits to main branch, last one 5 days ago
Batteries included CLI, TUI, and server implementations for DataFusion.
Created 2022-03-18
333 commits to main branch, last one 3 days ago
7
130
apache-2.0
4
Query and transform data with PRQL
This repository has been archived (exclude archived)
Created 2022-10-11
140 commits to main branch, last one 2 years ago
10x lower latency for cloud-native DataFusion
Created 2024-12-17
176 commits to main branch, last one 6 hours ago
Java binding to Apache DataFusion
Created 2021-10-12
169 commits to main branch, last one about a month ago
7
73
mit
5
A lightweight Logging and Tracing observability solution for Rust, built with Apache Arrow, Apache Parquet and Apache DataFusion.
Created 2022-01-03
229 commits to master branch, last one 6 months ago
14
70
unknown
3
etl engine 轻量级 跨平台 流批一体ETL引擎 数据抽取-转换-装载 ETL engine lightweight cross platform batch flow integration ETL engine data extraction transformation loading
Created 2022-04-21
239 commits to main branch, last one 4 months ago
S3 as an ObjectStore for DataFusion
Created 2022-01-04
108 commits to main branch, last one 2 years ago
5
59
other
2
Exon is an OLAP query engine specifically for biology and life science applications.
Created 2023-05-28
896 commits to main branch, last one 2 months ago
Notes on Data Engineering with Pandas, PySpark, Dask, Ray, Arrow DataFusion, Polars etc.
Created 2022-06-19
515 commits to main branch, last one 8 days ago
Community InfluxDB 3.0 "IOx" static builds + containers + Examples for Developers & Integrators. Experiment with low-cost storage, unlimited cardinality and FlightSQL APIs
Created 2023-05-20
201 commits to main branch, last one 17 days ago
Awesome list of alternative dataframe libraries in Python.
Created 2021-11-21
13 commits to main branch, last one 2 years ago
Blazing-Fast Bioinformatic Operations on Python DataFrames
Created 2024-11-26
88 commits to master branch, last one 5 days ago
Scale to zero Seafowl hosting with Cloud Run
Created 2023-04-22
10 commits to master branch, last one about a year ago
6
38
apache-2.0
5
Experimental Elixir bindings for Apache Arrow including Parquet and DataFusion
Created 2021-03-20
62 commits to main branch, last one 4 years ago
Incremental view maintenance & query rewriting for materialized views in DataFusion
Created 2024-12-23
22 commits to main branch, last one 5 days ago