139 results found Sort:

1.7k
17.3k
other
154
📊 Cube — The Semantic Layer for Building Data Applications
Created 2018-09-16
8,444 commits to master branch, last one 19 hours ago
2.1k
16.8k
other
386
🏆 零代码、全功能、强安全 ORM 库 🚀 后端接口和文档零代码,前端(客户端) 定制返回 JSON 的数据和结构。 🏆 A JSON Transmission Protocol and an ORM Library 🚀 provides APIs and Docs without writing any code.
Created 2016-11-21
3,188 commits to master branch, last one 5 days ago
5.3k
15.7k
apache-2.0
862
The official home of the Presto distributed SQL query engine for big data
Created 2012-08-09
22,862 commits to master branch, last one 13 hours ago
4.2k
15.4k
unknown
443
大数据入门指南 :star:
Created 2019-03-10
607 commits to master branch, last one about a year ago
1.6k
14.0k
apache-2.0
101
🔥🔥🔥AI-driven data management platform Over 1 million developers are using Chat2DB
Created 2023-06-20
3,598 commits to main branch, last one a day ago
3.1k
11.6k
apache-2.0
283
Apache Doris is an easy-to-use, high performance and unified analytics database.
Created 2017-08-10
19,618 commits to master branch, last one 16 hours ago
2.8k
9.7k
apache-2.0
169
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
Created 2019-01-19
38,004 commits to master branch, last one 13 hours ago
专注大数据学习面试,大数据成神之路开启。Flink/Spark/Hadoop/Hbase/Hive...
Created 2019-02-14
405 commits to master branch, last one about a year ago
578
5.8k
mit
38
Python SQL Parser and Transpiler
Created 2021-03-13
4,385 commits to main branch, last one a day ago
4.6k
5.4k
apache-2.0
336
Apache Hive
Created 2009-05-21
17,292 commits to master branch, last one a day ago
387
3.9k
apache-2.0
59
Lightweight and blazing fast key-value database written in pure Dart.
Created 2019-07-08
654 commits to main branch, last one 9 months ago
700
3.4k
apache-2.0
20
🔨 用 JSON 来生成结构化的 SQL 语句,基于 Vue3 + TypeScript + Vite + Ant Design + MonacoEditor 实现,项目简单(重逻辑轻页面)、适合练手~
Created 2022-05-12
19 commits to master branch, last one 2 years ago
1.1k
3.3k
apache-2.0
262
Apache Linkis builds a computation middleware layer to facilitate connection, governance and orchestration between the upper applications and the underlying data engines.
Created 2019-07-23
4,154 commits to master branch, last one 8 days ago
992
3.0k
apache-2.0
181
DataSphereStudio is a one stop data application development& management portal, covering scenarios including data exchange, desensitization/cleansing, analysis/mining, quality measurement, visualizati...
Created 2019-11-24
3,600 commits to master branch, last one 2 months ago
基于Spark的电影推荐系统,包含爬虫项目、web网站、后台管理系统以及spark推荐系统
Created 2018-04-18
27 commits to master branch, last one 5 years ago
844
2.5k
unknown
46
大数据学习,从零开始学习大数据,包含大数据学习各阶段学习视频、面试资料
Created 2019-11-30
848 commits to master branch, last one 5 months ago
605
2.2k
other
61
深圳地铁大数据客流分析系统🚇🚄🌟
Created 2020-04-13
288 commits to master branch, last one 4 months ago
582
2.1k
mit
123
A Flexible, Fast, Federated(3F) SQL Analysis Middleware for Multiple Data Sources
Created 2019-01-04
202 commits to master branch, last one about a year ago
864
2.0k
apache-2.0
64
Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.
Created 2017-12-18
3,972 commits to master branch, last one 2 days ago
979
1.9k
apache-2.0
155
Apache Drill is a distributed MPP query layer for self describing data
Created 2012-09-05
4,501 commits to master branch, last one 14 days ago
219
1.8k
apache-2.0
34
Querybook is a Big Data Querying UI, combining collocated table metadata and a simple notebook interface.
Created 2020-03-05
984 commits to master branch, last one 10 days ago
551
1.7k
other
62
Python interface to Hive and Presto. 🐝
Created 2014-02-01
189 commits to master branch, last one a day ago
大数据知识仓库涉及到数据仓库建模、实时计算、大数据、数据中台、系统设计、Java、算法等。
Created 2020-06-10
1,054 commits to master branch, last one 3 days ago
220
1.3k
unknown
48
后端开发常用框架文档及中文翻译,包含 Spring 系列文档(Spring, Spring Boot, Spring Cloud, Spring Security, Spring Session),大数据(Apache Hive, HBase, Apache Flume),日志(Log4j2, Logback),Http Server(NGINX,Apache),Python,数据库(OpenTS...
Created 2018-12-25
56 commits to master branch, last one 2 years ago
320
1.3k
apache-2.0
33
Taier is a big data development platform for submission, scheduling, operation and maintenance, and indicator information display
Created 2021-03-02
1,460 commits to master branch, last one 13 days ago
288
1.1k
apache-2.0
32
Addax is a versatile open-source ETL tool that can seamlessly transfer data between various RDBMS and NoSQL databases, making it an ideal solution for data migration.
Created 2019-07-17
1,407 commits to master branch, last one a day ago
More than 2000+ Data engineer interview questions.
Created 2021-08-08
16 commits to master branch, last one 12 days ago
81
810
apache-2.0
11
DataCap is integrated software for data transformation, integration, and visualization. Support a variety of data sources, file types, big data related database, relational database, NoSQL database, e...
Created 2022-09-17
1,775 commits to dev branch, last one 8 days ago
266
800
apache-2.0
74
Scriptis is for interactive data analysis with script development(SQL, Pyspark, HiveQL), task submission(Spark, Hive), UDF, function, resource management and intelligent diagnosis.
Created 2019-07-23
118 commits to master branch, last one 2 months ago
Uses tokenized query returned by python-sqlparse and generates query metadata
Created 2017-06-06
725 commits to master branch, last one 11 days ago