Statistics for topic data-mining

RepositoryStats tracks 631,840 Github repositories, of these 295 are tagged with the data-mining topic. The most common primary language for repositories using this topic is Python (134). Other languages include: Jupyter Notebook (40),  C++ (11),  HTML (11),  JavaScript (11)

Stargazers over time for topic data-mining

300300250250200200150150100100505000202020202021202120222022202320232024202420252025

Most starred repositories for topic data-mining (view more)

3.3k
26.1k
apache-2.0
318
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
Created 2020-03-14
618 commits to master branch, last one 6 months ago
:memo: An awesome Data Science repository to learn and apply for real world problems.
Created 2014-07-05
1,110 commits to live branch, last one 19 days ago
Machine Learning From Scratch. Bare bones NumPy implementations of machine learning models and algorithms with a focus on accessibility. Aims to cover everything from linear regression to deep learnin...
Created 2017-02-05
374 commits to master branch, last one 5 years ago
3.9k
17.1k
mit
435
A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tas...
Created 2016-08-05
3,650 commits to master branch, last one 22 days ago
4.4k
15.9k
lgpl-2.1
426
Topic Modelling for Humans
Created 2011-02-10
4,536 commits to develop branch, last one about a month ago

Trending repositories for topic data-mining (view more)