9 results found Sort:
- Filter by Primary Language:
- Dockerfile (1)
- Go (1)
- HCL (1)
- Java (1)
- JavaScript (1)
- Jupyter Notebook (1)
- Python (1)
- Rust (1)
- Scala (1)
- +
Open source security data lake for threat hunting, detection & response, and cybersecurity analytics at petabyte scale on AWS
Created
2022-07-03
575 commits to main branch, last one 6 months ago
Apache XTable (incubating) is a cross-table converter for lakehouse table formats that facilitates interoperability across data processing systems and query engines.
Created
2023-07-21
270 commits to main branch, last one 5 days ago
Use SQL to build ELT pipelines on a data lakehouse.
Created
2021-03-11
481 commits to main branch, last one 2 years ago
Lakehouse storage system benchmark
Created
2022-12-15
42 commits to main branch, last one about a year ago
Sample Data Lakehouse deployed in Docker containers using Apache Iceberg, Minio, Trino and a Hive Metastore. Can be used for local testing.
Created
2023-04-04
15 commits to main branch, last one about a year ago
The open-source, AI-native data stack
Created
2024-09-10
12 commits to main branch, last one 11 days ago
Jupyter notebooks and AWS CloudFormation template to show how Hudi, Iceberg, and Delta Lake work
Created
2022-02-02
4 commits to main branch, last one 2 years ago
An open-source, community-driven REST catalog for Apache Iceberg!
Created
2024-06-18
6 commits to main branch, last one 5 months ago
Stream CDC into an Amazon S3 data lake in Apache Iceberg table format with AWS Glue Streaming and DMS
Created
2023-01-19
43 commits to main branch, last one about a month ago