12 results found Sort:
- Filter by Primary Language:
- Python (3)
- Go (2)
- JavaScript (1)
- Jupyter Notebook (1)
- Rust (1)
- Scala (1)
- Dockerfile (1)
- TypeScript (1)
- Java (1)
- +
Open source security data lake for threat hunting, detection & response, and cybersecurity analytics at petabyte scale on AWS
Created
2022-07-03
576 commits to main branch, last one 2 months ago
Apache XTable (incubating) is a cross-table converter for lakehouse table formats that facilitates interoperability across data processing systems and query engines.
Created
2023-07-21
302 commits to main branch, last one 9 days ago
Fastest open-source tool for replicating Databases to Apache Iceberg or Data Lakehouse. ⚡ Efficient, quick and scalable data ingestion for real-time analytics. Supporting Postgres, MongoDB and MySQL
Created
2024-10-15
196 commits to master branch, last one 7 days ago
Use SQL to build ELT pipelines on a data lakehouse.
Created
2021-03-11
481 commits to main branch, last one 2 years ago
The open-source, AI-native data stack
Created
2024-09-10
688 commits to main branch, last one a day ago
Lakehouse storage system benchmark
Created
2022-12-15
42 commits to main branch, last one 2 years ago
Sample Data Lakehouse deployed in Docker containers using Apache Iceberg, Minio, Trino and a Hive Metastore. Can be used for local testing.
Created
2023-04-04
15 commits to main branch, last one about a year ago
Jupyter notebooks and AWS CloudFormation template to show how Hudi, Iceberg, and Delta Lake work
Created
2022-02-02
4 commits to main branch, last one 3 years ago
📡 Real-time data pipeline with Kafka, Flink, Iceberg, Trino, MinIO, and Superset. Ideal for learning data systems.
Created
2025-01-12
6 commits to main branch, last one 2 months ago
Stream CDC into an Amazon S3 data lake in Apache Iceberg table format with AWS Glue Streaming and DMS
Created
2023-01-19
44 commits to main branch, last one about a month ago
An open-source, community-driven REST catalog for Apache Iceberg!
Created
2024-06-18
6 commits to main branch, last one 8 months ago
Sample code to collect Apache Iceberg metrics for table monitoring
Created
2024-04-17
6 commits to main branch, last one 9 months ago