9 results found Sort:

101
1.5k
apache-2.0
22
Open source security data lake for threat hunting, detection & response, and cybersecurity analytics at petabyte scale on AWS
Created 2022-07-03
575 commits to main branch, last one 6 months ago
149
927
apache-2.0
28
Apache XTable (incubating) is a cross-table converter for lakehouse table formats that facilitates interoperability across data processing systems and query engines.
Created 2023-07-21
270 commits to main branch, last one 5 days ago
28
285
apache-2.0
12
Use SQL to build ELT pipelines on a data lakehouse.
Created 2021-03-11
481 commits to main branch, last one 2 years ago
9
66
apache-2.0
2
Lakehouse storage system benchmark
Created 2022-12-15
42 commits to main branch, last one about a year ago
Sample Data Lakehouse deployed in Docker containers using Apache Iceberg, Minio, Trino and a Hive Metastore. Can be used for local testing.
Created 2023-04-04
15 commits to main branch, last one about a year ago
The open-source, AI-native data stack
Created 2024-09-10
12 commits to main branch, last one 11 days ago
Jupyter notebooks and AWS CloudFormation template to show how Hudi, Iceberg, and Delta Lake work
Created 2022-02-02
4 commits to main branch, last one 2 years ago
2
25
apache-2.0
3
An open-source, community-driven REST catalog for Apache Iceberg!
Created 2024-06-18
6 commits to main branch, last one 5 months ago
Stream CDC into an Amazon S3 data lake in Apache Iceberg table format with AWS Glue Streaming and DMS
Created 2023-01-19
43 commits to main branch, last one about a month ago