Statistics for topic datasets

RepositoryStats tracks 603,127 Github repositories, of these 365 are tagged with the datasets topic. The most common primary language for repositories using this topic is Python (154). Other languages include: Jupyter Notebook (39)

Stargazers over time for topic datasets

909080807070606050504040303020201010002020202020212021202220222023202320242024

Most starred repositories for topic datasets (view more)

A topic-centric list of HQ open datasets.
Created 2014-11-20
781 commits to master branch, last one about a month ago
2.5k
20.2k
apache-2.0
179
Label Studio is a multi-type data labeling and annotation tool with standardized output format
Created 2019-06-19
4,224 commits to develop branch, last one a day ago
2.7k
19.5k
apache-2.0
278
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
Created 2020-03-26
3,974 commits to main branch, last one 2 days ago
1.4k
12.0k
apache-2.0
1.1k
pix2code: Generating Code from a Graphical User Interface Screenshot
Created 2017-05-24
26 commits to master branch, last one 3 years ago
2.0k
10.0k
mit
212
AKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库
Created 2019-10-01
8,698 commits to main branch, last one 19 hours ago
754
9.9k
agpl-3.0
88
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
Created 2018-05-11
1,749 commits to master branch, last one 13 days ago

Trending repositories for topic datasets (view more)