Statistics for topic big-data

RepositoryStats tracks 635,692 Github repositories, of these 371 are tagged with the big-data topic. The most common primary language for repositories using this topic is Java (94). Other languages include: Python (60),  Scala (32),  Jupyter Notebook (27),  C++ (22),  Rust (18),  JavaScript (15),  Go (14),  TypeScript (14)

Stargazers over time for topic big-data

450450400400350350300300250250200200150150100100505000202020202021202120222022202320232024202420252025

Most starred repositories for topic big-data (view more)

The Patterns of Scalable, Reliable, and Performant Large-Scale Systems
Created 2017-12-27
1,199 commits to master branch, last one 20 days ago
28.5k
40.9k
apache-2.0
2.0k
Apache Spark - A unified analytics engine for large-scale data processing
Created 2014-02-25
44,083 commits to master branch, last one a day ago
7.2k
39.9k
apache-2.0
690
ClickHouse® is a real-time analytics database management system
Created 2016-06-02
172,425 commits to master branch, last one a day ago
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AW...
Created 2015-01-23
543 commits to master branch, last one 6 years ago
13.5k
24.7k
apache-2.0
930
Apache Flink
Created 2014-06-07
36,705 commits to master branch, last one a day ago
1.2k
18.4k
other
319
An open source cybersecurity protocol for syncing decentralized graph data.
Created 2014-07-31
2,525 commits to master branch, last one 3 days ago

Trending repositories for topic big-data (view more)