9 results found Sort:

78
281
apache-2.0
20
Hive federation service. Enables disparate tables to be concurrently accessed across multiple Hive deployments.
Created 2017-07-17
516 commits to main branch, last one about a month ago
15
88
apache-2.0
18
Circus Train is a dataset replication tool that copies Hive tables between clusters and clouds.
Created 2017-11-17
267 commits to main branch, last one 2 years ago
27
71
apache-2.0
1
Apache Hive Metastore as a Standalone server in Docker
Created 2022-09-09
23 commits to main branch, last one 7 months ago
Sample Data Lakehouse deployed in Docker containers using Apache Iceberg, Minio, Trino and a Hive Metastore. Can be used for local testing.
Created 2023-04-04
15 commits to main branch, last one about a year ago
A client for connecting and running DDLs on hive metastore.
Created 2020-11-19
66 commits to main branch, last one 2 years ago
7
46
apache-2.0
9
Service for automatically managing and cleaning up unreferenced data
Created 2019-08-06
301 commits to main branch, last one 11 days ago
Dockerizing an Apache Spark Standalone Cluster
Created 2021-07-19
35 commits to main branch, last one 2 years ago
13
43
unknown
1
A Docker Compose template that builds a interactive development environment for PySpark with Jupyter Lab, MinIO as object storage, Hive Metastore, Trino and Kafka
Created 2023-02-22
40 commits to main branch, last one 3 months ago
End-to-end data platform: A PoC Data Platform project utilizing modern data stack (Spark, Airflow, DBT, Trino, Lightdash, Hive metastore, Minio, Postgres)
Created 2024-08-08
108 commits to main branch, last one 5 months ago