Statistics for topic datasets

RepositoryStats tracks 579,129 Github repositories, of these 356 are tagged with the datasets topic. The most common primary language for repositories using this topic is Python (150). Other languages include: Jupyter Notebook (39)

Stargazers over time for topic datasets

Most starred repositories for topic datasets (view more)

A topic-centric list of HQ open datasets.
Created 2014-11-20
778 commits to master branch, last one 12 days ago
2.7k
19.2k
apache-2.0
280
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
Created 2020-03-26
3,953 commits to main branch, last one 2 days ago
2.4k
19.2k
apache-2.0
176
Label Studio is a multi-type data labeling and annotation tool with standardized output format
Created 2019-06-19
3,995 commits to develop branch, last one 8 hours ago
1.4k
12.0k
apache-2.0
1.1k
pix2code: Generating Code from a Graphical User Interface Screenshot
Created 2017-05-24
26 commits to master branch, last one 3 years ago
752
9.7k
agpl-3.0
90
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
Created 2018-05-11
1,743 commits to master branch, last one 14 days ago
1.7k
9.5k
mit
134
Open source annotation tool for machine learning practitioners.
Created 2018-05-09
3,535 commits to master branch, last one 2 months ago

Trending repositories for topic datasets (view more)