47 results found Sort:

2.2k
14.7k
apache-2.0
164
A React component for building Web forms from JSON Schema.
Created 2015-12-16
1,735 commits to main branch, last one 13 days ago
821
10.4k
agpl-3.0
87
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
Created 2018-05-11
1,770 commits to master branch, last one 23 days ago
1.2k
6.4k
apache-2.0
48
OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team colla...
Created 2021-08-01
12,158 commits to main branch, last one 20 hours ago
659
6.0k
apache-2.0
48
Evidently is ​​an open-source ML and LLM observability framework. Evaluate, test, and monitor any AI-powered system or data pipeline. From tabular data to Gen AI. 100+ metrics.
Created 2020-11-25
2,530 commits to main branch, last one a day ago
266
3.8k
other
22
Deepchecks: Tests for Continuous Validation of ML Models & Data. Deepchecks is a holistic open-source solution for all of your AI & ML validation needs, enabling to thoroughly test your data and model...
Created 2021-10-11
1,498 commits to main branch, last one about a month ago
330
3.7k
mit
20
A light-weight, flexible, and expressive statistical data testing library
Created 2018-11-01
837 commits to main branch, last one 2 days ago
239
3.2k
isc
50
Lightweight, extensible data validation library for Python
Created 2012-10-10
1,148 commits to 1.3.x branch, last one 3 months ago
226
2.1k
apache-2.0
15
:zap: Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.io
Created 2020-12-14
788 commits to main branch, last one a day ago
Validation library with type-safe schemas and rules
Created 2015-07-22
1,950 commits to main branch, last one 15 days ago
72
1.1k
agpl-3.0
16
Automatically find issues in image datasets and practice data-centric computer vision.
Created 2022-05-26
338 commits to main branch, last one a day ago
58
943
other
31
Data quality assessment and metadata reporting for data frames and database tables
Created 2017-02-24
5,729 commits to main branch, last one 8 days ago
26
449
apache-2.0
10
The toolkit to test, validate, and evaluate your models and surface, curate, and prioritize the most valuable data for labeling.
Created 2022-09-21
654 commits to main branch, last one about a month ago
112
444
mit
18
Coercion and validation for data structures
Created 2017-04-17
1,758 commits to main branch, last one about a month ago
A proxy that validates responses and requests against an OpenAPI document. https://www.npmjs.com/package/openapi-cop https://hub.docker.com/r/lxlu/openapi-cop
Created 2020-02-19
564 commits to main branch, last one 2 years ago
12
361
mit
10
Elegant, highly efficient data validation for JavaScript.
Created 2015-03-31
102 commits to master branch, last one 6 years ago
11
330
apache-2.0
8
The data-validation toolkit for enhanced dbt (data build tool) PR review
Created 2023-10-06
2,274 commits to main branch, last one 2 days ago
174
286
unknown
22
Data Cleaning Libraries with Python
Created 2017-04-24
13 commits to master branch, last one 6 years ago
A dead simple Python string validation library.
Created 2017-06-07
46 commits to master branch, last one 5 years ago
Powerful CSV & Excel Import experience for SaaS πŸš€ Save months building data import experience from scratch πŸ’°
Created 2022-09-23
3,918 commits to next branch, last one 19 days ago
25
207
apache-2.0
6
βš“ Eurybia monitors model drift over time and securizes model deployment with data validation
Created 2022-05-02
124 commits to master branch, last one 5 months ago
Validator for the Brain Imaging Data Structure
Created 2015-06-09
5,186 commits to master branch, last one about a month ago
Typical: Fast, simple, & correct data-validation using Python 3 typing.
This repository has been archived (exclude archived)
Created 2019-03-15
535 commits to main branch, last one 6 months ago
41
157
apache-2.0
29
An RDF Unit Testing Suite
Created 2013-04-02
2,057 commits to master branch, last one about a year ago
24
120
apache-2.0
9
A collaborative framework for annotating medical datasets using crowdsourcing.
Created 2017-12-11
587 commits to master branch, last one 4 years ago
16
113
apache-2.0
1
Dingo: A Comprehensive Data Quality Evaluation Tool
Created 2024-12-24
94 commits to main branch, last one 4 days ago
A tool to validate data, built around Apache Spark.
Created 2019-04-17
502 commits to main branch, last one 6 days ago
7
100
mit
62
Declarative data validations.
Created 2017-02-10
306 commits to master branch, last one 2 years ago
Find out if your data is what you think it is
Created 2024-10-23
1,565 commits to main branch, last one 21 hours ago