30 results found Sort:

2.8k
9.7k
apache-2.0
169
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
Created 2019-01-19
38,004 commits to master branch, last one 11 hours ago
578
5.8k
mit
38
Python SQL Parser and Transpiler
Created 2021-03-13
4,385 commits to main branch, last one a day ago
539
4.4k
apache-2.0
82
the portable Python dataframe library
Created 2015-04-17
8,168 commits to main branch, last one 15 hours ago
240
2.9k
mit
22
Compare tables within or across databases
This repository has been archived (exclude archived)
Created 2022-03-07
1,932 commits to master branch, last one 14 days ago
89
1.7k
apache-2.0
37
Logica is a logic programming language that compiles to SQL. It runs on Google BigQuery, PostgreSQL and SQLite.
Created 2020-10-09
884 commits to main branch, last one a day ago
288
1.1k
apache-2.0
32
Addax is a versatile open-source ETL tool that can seamlessly transfer data between various RDBMS and NoSQL databases, making it an ideal solution for data migration.
Created 2019-07-17
1,407 commits to master branch, last one a day ago
81
810
apache-2.0
11
DataCap is integrated software for data transformation, integration, and visualization. Support a variety of data sources, file types, big data related database, relational database, NoSQL database, e...
Created 2022-09-17
1,775 commits to dev branch, last one 8 days ago
200
623
apache-2.0
29
Web UI for Trino, Hive and SparkSQL
Created 2015-02-02
1,088 commits to master branch, last one 2 years ago
54
413
apache-2.0
16
One framework to develop, deploy and operate data workflows with Python and SQL.
Created 2021-07-20
2,131 commits to main branch, last one a day ago
49
386
apache-2.0
10
Full platform database management tool, supports ClickHouse, Presto, Trino, MySQL, PostgreSQL, Apache Druid, ElasticSearch...
Created 2021-05-19
722 commits to dev branch, last one 7 months ago
159
383
apache-2.0
30
This library allows Scala and Java-based projects (including Apache Flink, Apache Hive, Apache Beam, and PrestoDB) to read from and write to Delta Lake.
This repository has been archived (exclude archived)
Created 2019-11-06
298 commits to master branch, last one 10 months ago
87
376
bsd-3-clause
16
New Generation Opensource Data Stack Demo
Created 2022-07-03
57 commits to main branch, last one about a year ago
SQL Parsers for BigData, built with antlr4.
Created 2018-07-02
410 commits to main branch, last one about a month ago
Quix Notebook Manager
Created 2019-04-14
1,116 commits to master branch, last one 10 days ago
This repository has no description...
Created 2021-02-24
40 commits to main branch, last one about a month ago
49
191
apache-2.0
8
The Trino (https://trino.io/) adapter plugin for dbt (https://getdbt.com)
Created 2021-07-14
524 commits to master branch, last one 22 days ago
12
133
gpl-3.0
4
Trino: Master your translations with command line!
Created 2017-03-16
17 commits to master branch, last one 4 years ago
The Workload Analyzer collects Presto® and Trino workload statistics, and analyzes them
Created 2020-11-02
37 commits to main branch, last one 11 months ago
24
84
apache-2.0
5
Storage connector for Trino
Created 2018-12-15
273 commits to master branch, last one 20 hours ago
A JupyterLab extension providing, SQL formatter, auto-completion, syntax highlighting, Spark SQL and Trino
Created 2021-11-02
300 commits to main branch, last one 29 days ago
2
79
apache-2.0
3
Prism is the easiest way to develop, orchestrate, and execute data pipelines in Python.
Created 2022-07-21
673 commits to main branch, last one 2 months ago
Trino dbt demo project to mix and load BigQuery data with and in a local PostgreSQL database
Created 2021-05-23
37 commits to main branch, last one 2 years ago
Starburst Metabase driver
Created 2022-05-16
101 commits to main branch, last one 3 months ago
21
54
apache-2.0
3
Apache Hive Metastore as a Standalone server in Docker
Created 2022-09-09
15 commits to main branch, last one 7 months ago
7
54
bsd-3-clause
4
New generation opensource data stack
Created 2022-05-20
8 commits to main branch, last one 2 years ago
4
53
apache-2.0
5
A library that brings useful functions from various modern database management systems to Apache Spark
Created 2020-04-02
52 commits to main branch, last one 9 months ago
Sample Data Lakehouse deployed in Docker containers using Apache Iceberg, Minio, Trino and a Hive Metastore. Can be used for local testing.
Created 2023-04-04
15 commits to main branch, last one 9 months ago
Code for my "Efficient Data Processing in SQL" book.
Created 2022-10-01
22 commits to main branch, last one 10 months ago
6
30
unknown
2
Playground for Lakehouse (Iceberg, Hudi, Spark, Flink, Trino, DBT, Airflow, Kafka, Debezium CDC)
Created 2023-08-27
37 commits to master branch, last one 8 months ago
7
28
unknown
2
A Docker Compose template that builds a interactive development environment for PySpark with Jupyter Lab, MinIO as object storage, Hive Metastore, Trino and Kafka
Created 2023-02-22
39 commits to main branch, last one 5 months ago