Statistics for topic datasets

RepositoryStats tracks 584,797 Github repositories, of these 356 are tagged with the datasets topic. The most common primary language for repositories using this topic is Python (150). Other languages include: Jupyter Notebook (39)

Stargazers over time for topic datasets

Most starred repositories for topic datasets (view more)

A topic-centric list of HQ open datasets.
Created 2014-11-20
781 commits to master branch, last one 7 days ago
2.4k
19.4k
apache-2.0
177
Label Studio is a multi-type data labeling and annotation tool with standardized output format
Created 2019-06-19
4,072 commits to develop branch, last one 16 hours ago
2.7k
19.3k
apache-2.0
279
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
Created 2020-03-26
3,954 commits to main branch, last one 2 days ago
1.4k
12.0k
apache-2.0
1.1k
pix2code: Generating Code from a Graphical User Interface Screenshot
Created 2017-05-24
26 commits to master branch, last one 3 years ago
751
9.8k
agpl-3.0
90
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
Created 2018-05-11
1,743 commits to master branch, last one 28 days ago
1.7k
9.6k
mit
134
Open source annotation tool for machine learning practitioners.
Created 2018-05-09
3,535 commits to master branch, last one 2 months ago

Trending repositories for topic datasets (view more)