Statistics for topic hadoop

RepositoryStats tracks 635,089 Github repositories, of these 188 are tagged with the hadoop topic. The most common primary language for repositories using this topic is Java (82). Other languages include: Python (22), Scala (17), Shell (12), Jupyter Notebook (11)

Stargazers over time for topic hadoop

Most starred repositories for topic hadoop (view more)

data-science-ipython-notebooks donnemartin

7.9k

28.1k

other

1.6k

Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AW...

aws caffe keras numpy scipy spark hadoop kaggle pandas python theano big-data mapreduce matplotlib tensorflow data-science scikit-learn deep-learning machine-learning

Created 2015-01-23

543 commits to master branch, last one 6 years ago

luigi spotify

2.4k

18.2k

apache-2.0

466

Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.

luigi hadoop python scheduling orchestration-framework

Created 2012-09-20

4,207 commits to master branch, last one 2 months ago

APIJSON Tencent

2.2k

17.7k

other

385

🏆 实时零代码、全功能、强安全 ORM 库 🚀 后端接口和文档零代码，前端(客户端) 定制返回 JSON 的数据和结构 🏆 Real-Time coding-free, powerful and secure ORM 🚀 providing APIs and Docs without coding by Backend, and the returned JSON of API can...