Statistics for topic data-quality

RepositoryStats tracks 518,325 Github repositories, of these 59 are tagged with the data-quality topic. The most common primary language for repositories using this topic is Python (24). Other languages include: Jupyter Notebook (12)

Stargazers over time for topic data-quality

Most starred repositories for topic data-quality (view more)

5.8k
36.0k
mit
1.2k
Learn how to design, develop, deploy and iterate on production-grade ML applications.
Created 2018-11-05
18 commits to main branch, last one 5 months ago
3.5k
26.0k
mit
924
📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.
Created 2020-07-04
485 commits to main branch, last one 14 days ago
1.6k
12.1k
mit
150
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
Created 2016-01-09
1,444 commits to develop branch, last one 9 days ago
677
8.8k
agpl-3.0
85
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
Created 2018-05-11
1,618 commits to master branch, last one 2 days ago
502
6.8k
apache-2.0
53
The open-source tool for building high-quality datasets and computer vision models
Created 2020-04-22
20,234 commits to develop branch, last one 3 days ago

Trending repositories for topic data-quality (view more)