27 results found Sort:
- Filter by Primary Language:
- Java (7)
- Python (4)
- Rust (4)
- C++ (3)
- Scala (2)
- TypeScript (1)
- Dockerfile (1)
- Go (1)
- HCL (1)
- JavaScript (1)
- +
The official home of the Presto distributed SQL query engine for big data
Created
2012-08-09
23,718 commits to master branch, last one a day ago
Apache Doris is an easy-to-use, high performance and unified analytics database.
Created
2017-08-10
24,003 commits to master branch, last one a day ago
The world's fastest open query engine for sub-second analytics both on and off the data lakehouse. With the flexibility to support nearly any scenario, StarRocks provides best-in-class performance for...
Created
2021-09-04
19,525 commits to main branch, last one 2 days ago
๐๐ฎ๐๐ฎ, ๐๐ป๐ฎ๐น๐๐๐ถ๐ฐ๐ & ๐๐. Modern alternative to Snowflake. Cost-effective and simple for massive-scale analytics. https://databend.com
Created
2020-10-10
32,327 commits to main branch, last one a day ago
LakeSoul is an end-to-end, realtime and cloud native Lakehouse framework with fast data ingestion, concurrent update and incremental data analytics on cloud storages for both BI and AI applications.
Created
2021-12-28
1,163 commits to main branch, last one 3 days ago
ByConity is an open source cloud data warehouse
Created
2022-12-22
72,496 commits to master branch, last one about a month ago
YTsaurus is a scalable and fault-tolerant open-source big data platform.
Created
2022-12-05
77,562 commits to main branch, last one 16 hours ago
World's most powerful open data catalog for building a high-performance, geo-distributed and federated metadata lake.
Created
2023-04-23
2,183 commits to main branch, last one a day ago
Apache Amoro (incubating) is a Lakehouse management system built on open data lake formats.
Created
2022-07-14
1,516 commits to master branch, last one 2 days ago
DuckDB-powered data lake analytics from Postgres
Created
2024-05-09
94 commits to dev branch, last one a day ago
Lakekeeper: A Rust native Iceberg REST Catalog
Created
2024-04-05
510 commits to main branch, last one 3 days ago
Iceberg/Delta Columnstore Table in Postgres
Created
2024-09-05
74 commits to main branch, last one 2 days ago
Use SQL to build ELT pipelines on a data lakehouse.
Created
2021-03-11
481 commits to main branch, last one 2 years ago
A modern data marketplace that makes collaboration among diverse users (like business, analysts and engineers) easier, increasing efficiency and agility in data projects on AWS.
Created
2022-03-08
1,158 commits to main branch, last one 19 days ago
The Lakehouse Engine is a configuration driven Spark framework, written in Python, serving as a scalable and distributed engine for several lakehouse algorithms, data flows and utilities for Data Prod...
Created
2022-11-11
17 commits to master branch, last one 2 months ago
AI ๆถไปฃ็ๆบ่ฝๆฐๆฎๅบ
Created
2019-07-16
221 commits to master branch, last one about a year ago
Examples of using Terraform to deploy Databricks resources
Created
2022-06-10
183 commits to main branch, last one 15 days ago
A curated list of open source tools used in analytics platforms and data engineering ecosystem
Created
2024-02-22
16 commits to main branch, last one 2 months ago
Pure Rust Iceberg Implementation
This repository has been archived
(exclude archived)
Created
2023-06-15
185 commits to main branch, last one 5 months ago
Unified storage framework for the entire machine learning lifecycle
Created
2023-12-15
99 commits to main branch, last one 10 months ago
Fastest open-source tool for replicating Databases to Apache Iceberg or Data Lakehouse. โก Efficient, quick and scalable data ingestion for real-time analytics. Starting with MongoDB
Created
2024-10-15
170 commits to master branch, last one 5 days ago
The open-source, AI-native data stack
Created
2024-09-10
57 commits to main branch, last one 2 days ago
Lakehouse storage system benchmark
Created
2022-12-15
42 commits to main branch, last one about a year ago
Sample Data Lakehouse deployed in Docker containers using Apache Iceberg, Minio, Trino and a Hive Metastore. Can be used for local testing.
Created
2023-04-04
15 commits to main branch, last one about a year ago
A curated list of awesome Online Analytical Processing databases, frameworks, ressources and other awesomeness.
Created
2023-08-27
6 commits to main branch, last one 7 days ago
Creation of a data lakehouse and an ELT pipeline to enable the efficient analysis and use of data
Created
2022-05-13
10 commits to master branch, last one about a year ago
DeltaOMS is a solution that help build a centralized repository of Delta Transaction logs and associated operational metrics/statistics for your Delta Lakehouse. Unity Catalog supported in the v0.7.0-...
Created
2021-04-12
165 commits to master branch, last one 2 years ago