26 results found Sort:

5.4k
16.1k
apache-2.0
855
The official home of the Presto distributed SQL query engine for big data
Created 2012-08-09
23,575 commits to master branch, last one 20 hours ago
3.3k
12.8k
apache-2.0
286
Apache Doris is an easy-to-use, high performance and unified analytics database.
Created 2017-08-10
23,090 commits to master branch, last one 24 hours ago
1.8k
9.0k
apache-2.0
208
The world's fastest open query engine for sub-second analytics both on and off the data lakehouse. With the flexibility to support nearly any scenario, StarRocks provides best-in-class performance for...
Created 2021-09-04
19,012 commits to main branch, last one 19 hours ago
752
7.9k
other
94
𝗗𝗮𝘁𝗮, 𝗔𝗻𝗮𝗹𝘆𝘁𝗶𝗰𝘀 & 𝗔𝗜. Modern alternative to Snowflake. Cost-effective and simple for massive-scale analytics. https://databend.com
Created 2020-10-10
32,104 commits to main branch, last one 20 hours ago
424
2.4k
apache-2.0
248
LakeSoul is an end-to-end, realtime and cloud native Lakehouse framework with fast data ingestion, concurrent update and incremental data analytics on cloud storages for both BI and AI applications.
Created 2021-12-28
1,151 commits to main branch, last one 24 hours ago
332
2.2k
apache-2.0
57
ByConity is an open source cloud data warehouse
Created 2022-12-22
72,389 commits to master branch, last one a day ago
136
1.9k
apache-2.0
39
YTsaurus is a scalable and fault-tolerant open-source big data platform.
Created 2022-12-05
76,353 commits to main branch, last one 16 hours ago
341
1.1k
apache-2.0
27
World's most powerful open data catalog for building a high-performance, geo-distributed and federated metadata lake.
Created 2023-04-23
1,937 commits to main branch, last one a day ago
290
873
apache-2.0
35
Apache Amoro (incubating) is a Lakehouse management system built on open data lake formats.
Created 2022-07-14
1,491 commits to master branch, last one 6 hours ago
15
382
postgresql
2
DuckDB-powered analytics for Postgres
Created 2024-05-09
83 commits to dev branch, last one 2 days ago
28
285
apache-2.0
12
Use SQL to build ELT pipelines on a data lakehouse.
Created 2021-03-11
481 commits to main branch, last one 2 years ago
82
235
apache-2.0
13
A modern data marketplace that makes collaboration among diverse users (like business, analysts and engineers) easier, increasing efficiency and agility in data projects on AWS.
Created 2022-03-08
1,130 commits to main branch, last one 18 hours ago
15
234
apache-2.0
2
Lakekeeper: A Rust native Iceberg REST Catalog
Created 2024-04-05
345 commits to main branch, last one a day ago
52
226
other
20
AI 时代的智能数据库
Created 2019-07-16
221 commits to master branch, last one about a year ago
38
224
apache-2.0
18
The Lakehouse Engine is a configuration driven Spark framework, written in Python, serving as a scalable and distributed engine for several lakehouse algorithms, data flows and utilities for Data Prod...
Created 2022-11-11
17 commits to master branch, last one 23 days ago
Iceberg/Delta Columnstore Table in Postgres
Created 2024-09-05
35 commits to main branch, last one a day ago
Examples of using Terraform to deploy Databricks resources
Created 2022-06-10
173 commits to main branch, last one 3 days ago
19
166
apache-2.0
12
Pure Rust Iceberg Implementation
This repository has been archived (exclude archived)
Created 2023-06-15
185 commits to main branch, last one 3 months ago
7
149
apache-2.0
9
Unified storage framework for the entire machine learning lifecycle
Created 2023-12-15
99 commits to main branch, last one 8 months ago
A curated list of open source tools used in analytics platforms and data engineering ecosystem
Created 2024-02-22
16 commits to main branch, last one 14 days ago
9
66
apache-2.0
2
Lakehouse storage system benchmark
Created 2022-12-15
42 commits to main branch, last one about a year ago
Sample Data Lakehouse deployed in Docker containers using Apache Iceberg, Minio, Trino and a Hive Metastore. Can be used for local testing.
Created 2023-04-04
15 commits to main branch, last one about a year ago
The open-source, AI-native data stack
Created 2024-09-10
11 commits to main branch, last one 2 months ago
A curated list of awesome Online Analytical Processing databases, frameworks, ressources and other awesomeness.
Created 2023-08-27
4 commits to main branch, last one about a year ago
Creation of a data lakehouse and an ELT pipeline to enable the efficient analysis and use of data
Created 2022-05-13
10 commits to master branch, last one 11 months ago
DeltaOMS is a solution that help build a centralized repository of Delta Transaction logs and associated operational metrics/statistics for your Delta Lakehouse. Unity Catalog supported in the v0.7.0-...
Created 2021-04-12
165 commits to master branch, last one 2 years ago