Statistics for topic big-data

RepositoryStats tracks 518,986 Github repositories, of these 342 are tagged with the big-data topic. The most common primary language for repositories using this topic is Java (91). Other languages include: Python (55),  Scala (29),  Jupyter Notebook (24),  C++ (21),  JavaScript (16),  Go (13),  Rust (11),  TypeScript (11)

Stargazers over time for topic big-data

Most starred repositories for topic big-data (view more)

The Patterns of Scalable, Reliable, and Performant Large-Scale Systems
Created 2017-12-27
1,182 commits to master branch, last one 3 days ago
28.0k
38.5k
apache-2.0
2.0k
Apache Spark - A unified analytics engine for large-scale data processing
Created 2014-02-25
41,039 commits to master branch, last one 16 hours ago
6.5k
34.6k
apache-2.0
684
ClickHouse® is a free analytics DBMS for big data
Created 2016-06-02
141,862 commits to master branch, last one 15 hours ago
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AW...
Created 2015-01-23
543 commits to master branch, last one 5 years ago
13.0k
23.2k
apache-2.0
945
Apache Flink
Created 2014-06-07
35,303 commits to master branch, last one 17 hours ago
1.1k
17.8k
other
320
An open source cybersecurity protocol for syncing decentralized graph data.
Created 2014-07-31
2,498 commits to master branch, last one about a month ago

Trending repositories for topic big-data (view more)