15 results found Sort:

1.9k
8.4k
apache-2.0
173
SeaTunnel is a next-generation super high-performance, distributed, massive data integration tool.
Created 2017-08-05
4,832 commits to dev branch, last one 3 days ago
78
2.9k
mit
20
ingestr is a CLI tool to copy data between any databases with a single command seamlessly.
Created 2024-02-12
1,037 commits to main branch, last one 4 days ago
1.0k
2.7k
apache-2.0
74
Apache Paimon is a lake format that enables building a Realtime Lakehouse Architecture with Flink and Spark for both streaming and batch operations.
Created 2022-01-12
4,075 commits to master branch, last one a day ago
165
2.5k
apache-2.0
44
Concurrent and multi-stage data ingestion and data processing with Elixir
Created 2018-11-05
415 commits to main branch, last one a day ago
408
2.0k
apache-2.0
106
Pravega - Streaming as a new software defined storage primitive
Created 2016-07-11
3,297 commits to master branch, last one about a month ago
33
915
apache-2.0
8
Build data pipelines with SQL and Python, ingest data from different sources, add quality checks, and build end-to-end flows.
Created 2023-08-03
2,677 commits to main branch, last one a day ago
Copy to/from Parquet in S3 or Azure Blob Storage from within PostgreSQL
Created 2024-09-04
66 commits to main branch, last one 3 days ago
11
317
other
4
Orbital automates integration between data sources (APIs, Databases, Queues and Functions). BFF's, API Composition and ETL pipelines that adapt as your specs change.
Created 2022-09-26
6,433 commits to develop branch, last one 22 days ago
28
285
apache-2.0
11
Use SQL to build ELT pipelines on a data lakehouse.
Created 2021-03-11
481 commits to main branch, last one 3 years ago
A Python library that enables ML teams to share, load, and transform data in a collaborative, flexible, and efficient way :chestnut:
Created 2022-02-11
130 commits to main branch, last one 2 months ago
35
112
apache-2.0
12
Apache Paimon Rust The rust implementation of Apache Paimon.
Created 2024-07-05
40 commits to main branch, last one 6 months ago
The Data Engineering Book - หนังสือวิศวกรรมข้อมูล ของคนไทย เพื่อคนไทย
Created 2021-01-07
226 commits to main branch, last one about a year ago
49
101
apache-2.0
17
Apache Spark examples exclusively in Java
Created 2016-06-26
215 commits to master branch, last one 3 years ago
7
69
apache-2.0
3
The modular, open-source backend for building AI-native software — powered by knowledge, not static data.
Created 2025-04-02
93 commits to main branch, last one 2 days ago