Statistics for topic big-data
RepositoryStats tracks 561,412 Github repositories, of these 352 are tagged with the big-data topic. The most common primary language for repositories using this topic is Java (93). Other languages include: Python (57), Scala (29), Jupyter Notebook (26), C++ (21), JavaScript (15), Rust (14), Go (13), TypeScript (12)
Stargazers over time for topic big-data
Most starred repositories for topic big-data (view more)
Trending repositories for topic big-data (view more)
Blazing-fast query execution engine speaks Apache Spark language and has Arrow-DataFusion at its core.
The Patterns of Scalable, Reliable, and Performant Large-Scale Systems
LakeSail's computation framework with a mission to unify stream processing, batch processing, and compute-intensive (AI) workloads.
Blazing-fast query execution engine speaks Apache Spark language and has Arrow-DataFusion at its core.
LakeSail's computation framework with a mission to unify stream processing, batch processing, and compute-intensive (AI) workloads.
The Patterns of Scalable, Reliable, and Performant Large-Scale Systems
LakeSail's computation framework with a mission to unify stream processing, batch processing, and compute-intensive (AI) workloads.
Blazing-fast query execution engine speaks Apache Spark language and has Arrow-DataFusion at its core.
The Patterns of Scalable, Reliable, and Performant Large-Scale Systems
Cloud-native search engine for observability. An open-source alternative to Datadog, Elasticsearch, Loki, and Tempo.
LakeSail's computation framework with a mission to unify stream processing, batch processing, and compute-intensive (AI) workloads.
Apache Paimon Rust The rust implementation of Apache Paimon.
Bigtop Manager provides a modern, low-threshold web application to simplify the deployment and management of components for Bigtop, similar to Apache Ambari and Cloudera Manager.
An open-source, high-performance SQL vector database built on ClickHouse.
Un repositorio más con conceptos básicos, desafíos técnicos y recursos sobre ingeniería de datos en español 🧙✨
The Patterns of Scalable, Reliable, and Performant Large-Scale Systems
Cloud-native search engine for observability. An open-source alternative to Datadog, Elasticsearch, Loki, and Tempo.
StarRocks, a Linux Foundation project, is a next-generation sub-second MPP OLAP database for full analytics scenarios, including multi-dimensional analytics, real-time analytics, and ad-hoc queries.
An open-source, high-performance SQL vector database built on ClickHouse.
Bigtop Manager provides a modern, low-threshold web application to simplify the deployment and management of components for Bigtop, similar to Apache Ambari and Cloudera Manager.