Statistics for topic data-quality

RepositoryStats tracks 643,405 Github repositories, of these 81 are tagged with the data-quality topic. The most common primary language for repositories using this topic is Python (35). Other languages include: Jupyter Notebook (15)

Stargazers over time for topic data-quality

Most starred repositories for topic data-quality (view more)

Made-With-ML GokuMohandas

6.1k

38.4k

mit

1.2k

Learn how to design, develop, deploy and iterate on production-grade ML applications.

ray llms mlops python pytorch data-quality data-science deep-learning distributed-ml data-engineering machine-learning distributed-training natural-language-processing

Created 2018-11-05

18 commits to main branch, last one about a year ago

applied-ml eugeneyan

3.7k

27.9k

mit

951

📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.

recsys search production data-quality data-science deep-learning data-discovery computer-vision data-engineering machine-learning applied-data-science reinforcement-learning applied-machine-learning natural-language-processing

Created 2020-07-04

485 commits to main branch, last one 11 months ago

ydata-profiling ydataai

1.7k

12.9k

mit

149

1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.

Created 2016-01-09

1,575 commits to develop branch, last one about a month ago

cleanlab cleanlab

825

10.5k

agpl-3.0

87

The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.

Created 2018-05-11

1,772 commits to master branch, last one 16 days ago

great_expectations great-expectations

1.6k

10.3k

apache-2.0

83

Always know what to expect from your data.

Created 2017-09-11

13,120 commits to develop branch, last one 22 hours ago

fiftyone voxel51

625

9.4k

apache-2.0

65

Refine high-quality datasets and visual AI models

python data-quality data-science data-cleaning data-curation deep-learning vector-search visualization active-learning computer-vision data-centric-ai developer-tools machine-learning object-detection unstructured-data image-classification artificial-intelligence

Created 2020-04-22

23,246 commits to develop branch, last one 8 hours ago

Trending repositories for topic data-quality (view more)