Statistics for topic dataset

RepositoryStats tracks 650,729 Github repositories, of these 1,253 are tagged with the dataset topic. The most common primary language for repositories using this topic is Python (667). Other languages include: Jupyter Notebook (165),  C++ (25),  JavaScript (23),  HTML (19),  R (16),  MATLAB (15),  TypeScript (12),  Shell (11)

Stargazers over time for topic dataset

Most starred repositories for topic dataset (view more)

35.8k
339.5k
mit
4.3k
A collective list of free APIs
Created 2016-03-20
4,535 commits to master branch, last one 6 months ago
2.7k
22.1k
apache-2.0
182
Label Studio is a multi-type data labeling and annotation tool with standardized output format
Created 2019-06-19
4,831 commits to develop branch, last one 14 hours ago
2.0k
18.4k
mit
224
Faker is a Python package that generates fake data for you.
Created 2012-11-12
3,990 commits to master branch, last one 18 hours ago
pix2tex: Using a ViT to convert images of equations into LaTeX code.
Created 2020-12-11
324 commits to main branch, last one 3 months ago
3.2k
13.7k
mit
183
Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale.
Created 2018-06-29
5,157 commits to develop branch, last one about an hour ago
A MNIST-like fashion product database. Benchmark :point_down:
Created 2017-08-25
224 commits to master branch, last one 3 years ago

Trending repositories for topic dataset (view more)