28 results found Sort:

913
4.7k
apache-2.0
46
OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team colla...
Created 2021-08-01
9,664 commits to main branch, last one 15 hours ago
191
1.8k
apache-2.0
12
:zap: Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.io
Created 2020-12-14
696 commits to main branch, last one 7 hours ago
152
1.8k
apache-2.0
9
The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.
Created 2021-08-30
4,878 commits to master branch, last one a day ago
294
1.7k
apache-2.0
47
Collect, aggregate, and visualize a data ecosystem's metadata
Created 2018-07-05
2,728 commits to main branch, last one 2 days ago
215
1.2k
mit
21
SQL Lineage Analysis Tool powered by Python
Created 2019-05-21
375 commits to master branch, last one about a month ago
First open-source data discovery and observability platform. We make a life for data practitioners easy so you can focus on your business.
Created 2021-07-07
818 commits to main branch, last one 21 hours ago
258
771
apache-2.0
37
Egeria core
Created 2018-05-31
20,665 commits to main branch, last one a day ago
19
306
apache-2.0
11
Titan Core - Snowflake infrastructure-as-code. Provision environments, automate deploys, CI/CD. Manage RBAC, users, roles, and data access. Declarative Python Resource API. Change Management tool for ...
Created 2023-05-13
141 commits to main branch, last one a day ago
Generate and Visualize Data Lineage from query history
Created 2020-03-17
100 commits to master branch, last one 10 months ago
34
164
unknown
5
Main repo including core data model, data marts, reference data, terminology, and the clinical concept library
Created 2021-11-12
488 commits to main branch, last one 15 hours ago
Reference implementation for real-time Data Lineage tracking for BigQuery using Audit Logs, ZetaSQL and Dataflow.
Created 2020-06-09
57 commits to master branch, last one 6 months ago
71
135
apache-2.0
2
This Apache Atlas is built from the latest release source tarball and patched to be run in a Docker container.
Created 2019-05-24
102 commits to master branch, last one about a year ago
ODD Specification is a universal open standard for collecting metadata.
Created 2020-12-16
187 commits to main branch, last one about a month ago
19
113
mit
7
Pebblo enables developers to safely load data and promote their Gen AI app to deployment
Created 2024-01-15
354 commits to main branch, last one 6 days ago
28
100
apache-2.0
34
Egeria's Guidance on Governance as well as large media files such as presentations and movies
This repository has been archived (exclude archived)
Created 2017-10-09
397 commits to main branch, last one about a year ago
20
88
apache-2.0
14
HiveMQ Edge is an MQTT gateway that enables interoperability between OT devices and IT systems. It translates diverse protocols into MQTT for streamlined communication and helps organize data into a u...
Created 2023-06-30
1,164 commits to master branch, last one 6 days ago
Mapping of DWH database tables to business entities, attributes & metrics in Python, with automatic creation of flattened tables
Created 2020-05-26
30 commits to main branch, last one about a year ago
POC to demonstrate how to alter incoming/outgoing records in Kafka. It's a toy, don't use it in production.
Created 2023-03-31
215 commits to main branch, last one 4 months ago
A boilerplate solution for processing image and PDF documents for regulated industries, with lineage and pipeline operations metadata services.
Created 2021-03-16
23 commits to main branch, last one 2 years ago
Data Quality Gate based on AWS
Created 2022-05-31
342 commits to main branch, last one 8 months ago
13
48
apache-2.0
8
Data catalog for everything in your company
This repository has been archived (exclude archived)
Created 2021-10-22
134 commits to master branch, last one about a year ago
Open-source metadata collector based on ODD Specification
This repository has been archived (exclude archived)
Created 2022-02-10
259 commits to main branch, last one 7 months ago
Data-Export支持将链上数据导出到MySQL、ES等便于进行大数据处理的存储介质中,解决区块链数据复杂查询、分析、可视化和处理的问题。
Created 2020-10-21
46 commits to master branch, last one about a year ago
Identify and tokenize sensitive data automatically using Cloud DLP and Dataflow
Created 2021-01-05
76 commits to main branch, last one 5 months ago
1
31
apache-2.0
3
Data policy IN, dynamic view OUT: PACE is the Policy As Code Engine. It helps you to programatically create and apply a data policy to a processing platform like Databricks, Snowflake or BigQuery (or ...
Created 2023-10-18
609 commits to alpha branch, last one 6 days ago
Guide to data platforms and tools
Created 2022-03-09
2 commits to main branch, last one 2 years ago