9 results found Sort:
- Filter by Primary Language:
- Java (3)
- Python (2)
- Dockerfile (1)
- Jupyter Notebook (1)
- Thrift (1)
- VBA (1)
- +
Hive federation service. Enables disparate tables to be concurrently accessed across multiple Hive deployments.
Created
2017-07-17
516 commits to main branch, last one 8 days ago
Circus Train is a dataset replication tool that copies Hive tables between clusters and clouds.
Created
2017-11-17
267 commits to main branch, last one 2 years ago
Apache Hive Metastore as a Standalone server in Docker
Created
2022-09-09
23 commits to main branch, last one 6 months ago
Sample Data Lakehouse deployed in Docker containers using Apache Iceberg, Minio, Trino and a Hive Metastore. Can be used for local testing.
Created
2023-04-04
15 commits to main branch, last one about a year ago
A client for connecting and running DDLs on hive metastore.
Created
2020-11-19
66 commits to main branch, last one 2 years ago
Service for automatically managing and cleaning up unreferenced data
Created
2019-08-06
298 commits to main branch, last one 22 days ago
Dockerizing an Apache Spark Standalone Cluster
Created
2021-07-19
35 commits to main branch, last one 2 years ago
A Docker Compose template that builds a interactive development environment for PySpark with Jupyter Lab, MinIO as object storage, Hive Metastore, Trino and Kafka
Created
2023-02-22
40 commits to main branch, last one 2 months ago
End-to-end data platform: A PoC Data Platform project utilizing modern data stack (Spark, Airflow, DBT, Trino, Lightdash, Hive metastore, Minio, Postgres)
Created
2024-08-08
108 commits to main branch, last one 4 months ago