Statistics for topic datasets

RepositoryStats tracks 605,968 Github repositories, of these 366 are tagged with the datasets topic. The most common primary language for repositories using this topic is Python (155). Other languages include: Jupyter Notebook (39)

Stargazers over time for topic datasets

Most starred repositories for topic datasets (view more)

A topic-centric list of HQ open datasets.
Created 2014-11-20
781 commits to master branch, last one 2 months ago
2.5k
20.3k
apache-2.0
180
Label Studio is a multi-type data labeling and annotation tool with standardized output format
Created 2019-06-19
4,272 commits to develop branch, last one 5 hours ago
2.7k
19.5k
apache-2.0
278
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
Created 2020-03-26
3,976 commits to main branch, last one 4 days ago
1.4k
12.0k
apache-2.0
1.1k
pix2code: Generating Code from a Graphical User Interface Screenshot
Created 2017-05-24
26 commits to master branch, last one 3 years ago
790
10.1k
agpl-3.0
88
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
Created 2018-05-11
1,750 commits to master branch, last one 9 days ago
2.0k
10.1k
mit
210
AKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库
Created 2019-10-01
8,728 commits to main branch, last one 7 hours ago

Trending repositories for topic datasets (view more)