15 results found Sort:
- Filter by Primary Language:
- Java (4)
- Python (3)
- JavaScript (2)
- Rust (2)
- TypeScript (2)
- Elixir (1)
- Go (1)
- +
SeaTunnel is a next-generation super high-performance, distributed, massive data integration tool.
Created
2017-08-05
4,832 commits to dev branch, last one 3 days ago
ingestr is a CLI tool to copy data between any databases with a single command seamlessly.
Created
2024-02-12
1,037 commits to main branch, last one 4 days ago
Apache Paimon is a lake format that enables building a Realtime Lakehouse Architecture with Flink and Spark for both streaming and batch operations.
Created
2022-01-12
4,075 commits to master branch, last one a day ago
Concurrent and multi-stage data ingestion and data processing with Elixir
Created
2018-11-05
415 commits to main branch, last one a day ago
Pravega - Streaming as a new software defined storage primitive
Created
2016-07-11
3,297 commits to master branch, last one about a month ago
Build data pipelines with SQL and Python, ingest data from different sources, add quality checks, and build end-to-end flows.
Created
2023-08-03
2,677 commits to main branch, last one a day ago
Copy to/from Parquet in S3 or Azure Blob Storage from within PostgreSQL
Created
2024-09-04
66 commits to main branch, last one 3 days ago
Orbital automates integration between data sources (APIs, Databases, Queues and Functions). BFF's, API Composition and ETL pipelines that adapt as your specs change.
Created
2022-09-26
6,433 commits to develop branch, last one 22 days ago
Use SQL to build ELT pipelines on a data lakehouse.
Created
2021-03-11
481 commits to main branch, last one 3 years ago
A Python library that enables ML teams to share, load, and transform data in a collaborative, flexible, and efficient way :chestnut:
Created
2022-02-11
130 commits to main branch, last one 2 months ago
Apache Paimon Rust The rust implementation of Apache Paimon.
Created
2024-07-05
40 commits to main branch, last one 6 months ago
The Data Engineering Book - หนังสือวิศวกรรมข้อมูล ของคนไทย เพื่อคนไทย
Created
2021-01-07
226 commits to main branch, last one about a year ago
Apache Spark examples exclusively in Java
Created
2016-06-26
215 commits to master branch, last one 3 years ago
The modular, open-source backend for building AI-native software — powered by knowledge, not static data.
Created
2025-04-02
93 commits to main branch, last one 2 days ago
Squirrel dataset hub
This repository has been archived
(exclude archived)
Created
2022-02-01
118 commits to main branch, last one about a year ago