9 results found Sort:

77
277
apache-2.0
21
Hive federation service. Enables disparate tables to be concurrently accessed across multiple Hive deployments.
Created 2017-07-17
516 commits to main branch, last one 8 days ago
15
88
apache-2.0
19
Circus Train is a dataset replication tool that copies Hive tables between clusters and clouds.
Created 2017-11-17
267 commits to main branch, last one 2 years ago
27
68
apache-2.0
1
Apache Hive Metastore as a Standalone server in Docker
Created 2022-09-09
23 commits to main branch, last one 6 months ago
Sample Data Lakehouse deployed in Docker containers using Apache Iceberg, Minio, Trino and a Hive Metastore. Can be used for local testing.
Created 2023-04-04
15 commits to main branch, last one about a year ago
A client for connecting and running DDLs on hive metastore.
Created 2020-11-19
66 commits to main branch, last one 2 years ago
7
46
apache-2.0
10
Service for automatically managing and cleaning up unreferenced data
Created 2019-08-06
298 commits to main branch, last one 22 days ago
Dockerizing an Apache Spark Standalone Cluster
Created 2021-07-19
35 commits to main branch, last one 2 years ago
12
40
unknown
1
A Docker Compose template that builds a interactive development environment for PySpark with Jupyter Lab, MinIO as object storage, Hive Metastore, Trino and Kafka
Created 2023-02-22
40 commits to main branch, last one 2 months ago
End-to-end data platform: A PoC Data Platform project utilizing modern data stack (Spark, Airflow, DBT, Trino, Lightdash, Hive metastore, Minio, Postgres)
Created 2024-08-08
108 commits to main branch, last one 4 months ago