28 results found Sort:
- Filter by Primary Language:
- Python (11)
- Java (8)
- HTML (2)
- Kotlin (1)
- Shell (1)
- TypeScript (1)
- +
OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team colla...
Created
2021-08-01
9,664 commits to main branch, last one 15 hours ago
:zap: Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.io
Created
2020-12-14
696 commits to main branch, last one 7 hours ago
The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.
Created
2021-08-30
4,878 commits to master branch, last one a day ago
Collect, aggregate, and visualize a data ecosystem's metadata
Created
2018-07-05
2,728 commits to main branch, last one 2 days ago
SQL Lineage Analysis Tool powered by Python
Created
2019-05-21
375 commits to master branch, last one about a month ago
First open-source data discovery and observability platform. We make a life for data practitioners easy so you can focus on your business.
Created
2021-07-07
818 commits to main branch, last one 21 hours ago
Egeria core
Created
2018-05-31
20,665 commits to main branch, last one a day ago
Titan Core - Snowflake infrastructure-as-code. Provision environments, automate deploys, CI/CD. Manage RBAC, users, roles, and data access. Declarative Python Resource API. Change Management tool for ...
Created
2023-05-13
141 commits to main branch, last one a day ago
Metrics Observability & Troubleshooting
Created
2023-04-21
404 commits to main branch, last one 3 months ago
Generate and Visualize Data Lineage from query history
Created
2020-03-17
100 commits to master branch, last one 10 months ago
Main repo including core data model, data marts, reference data, terminology, and the clinical concept library
Created
2021-11-12
488 commits to main branch, last one 15 hours ago
Reference implementation for real-time Data Lineage tracking for BigQuery using Audit Logs, ZetaSQL and Dataflow.
Created
2020-06-09
57 commits to master branch, last one 6 months ago
This Apache Atlas is built from the latest release source tarball and patched to be run in a Docker container.
Created
2019-05-24
102 commits to master branch, last one about a year ago
Open Source Data Quality Monitoring.
Created
2023-07-15
128 commits to main branch, last one 4 months ago
ODD Specification is a universal open standard for collecting metadata.
Created
2020-12-16
187 commits to main branch, last one about a month ago
Pebblo enables developers to safely load data and promote their Gen AI app to deployment
Created
2024-01-15
354 commits to main branch, last one 6 days ago
Egeria's Guidance on Governance as well as large media files such as presentations and movies
This repository has been archived
(exclude archived)
Created
2017-10-09
397 commits to main branch, last one about a year ago
HiveMQ Edge is an MQTT gateway that enables interoperability between OT devices and IT systems. It translates diverse protocols into MQTT for streamlined communication and helps organize data into a u...
Created
2023-06-30
1,164 commits to master branch, last one 6 days ago
Mapping of DWH database tables to business entities, attributes & metrics in Python, with automatic creation of flattened tables
Created
2020-05-26
30 commits to main branch, last one about a year ago
POC to demonstrate how to alter incoming/outgoing records in Kafka. It's a toy, don't use it in production.
Created
2023-03-31
215 commits to main branch, last one 4 months ago
A boilerplate solution for processing image and PDF documents for regulated industries, with lineage and pipeline operations metadata services.
Created
2021-03-16
23 commits to main branch, last one 2 years ago
Data Quality Gate based on AWS
Created
2022-05-31
342 commits to main branch, last one 8 months ago
Data catalog for everything in your company
This repository has been archived
(exclude archived)
Created
2021-10-22
134 commits to master branch, last one about a year ago
Open-source metadata collector based on ODD Specification
This repository has been archived
(exclude archived)
Created
2022-02-10
259 commits to main branch, last one 7 months ago
Data-Export支持将链上数据导出到MySQL、ES等便于进行大数据处理的存储介质中,解决区块链数据复杂查询、分析、可视化和处理的问题。
Created
2020-10-21
46 commits to master branch, last one about a year ago
Identify and tokenize sensitive data automatically using Cloud DLP and Dataflow
Created
2021-01-05
76 commits to main branch, last one 5 months ago
Data policy IN, dynamic view OUT: PACE is the Policy As Code Engine. It helps you to programatically create and apply a data policy to a processing platform like Databricks, Snowflake or BigQuery (or ...
Created
2023-10-18
609 commits to alpha branch, last one 6 days ago
Guide to data platforms and tools
Created
2022-03-09
2 commits to main branch, last one 2 years ago