31 results found Sort:

3.0k
10.0k
apache-2.0
254
The Metadata Platform for your Data and AI Stack
Created 2015-11-18
10,756 commits to master branch, last one 3 hours ago
1.1k
5.7k
apache-2.0
48
OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team colla...
Created 2021-08-01
11,067 commits to main branch, last one 2 hours ago
165
1.9k
apache-2.0
12
The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.
Created 2021-08-30
5,068 commits to master branch, last one 14 days ago
211
1.9k
apache-2.0
12
:zap: Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.io
Created 2020-12-14
764 commits to main branch, last one a day ago
321
1.8k
apache-2.0
47
Collect, aggregate, and visualize a data ecosystem's metadata
Created 2018-07-05
2,830 commits to main branch, last one 11 days ago
241
1.3k
mit
22
SQL Lineage Analysis Tool powered by Python
Created 2019-05-21
375 commits to master branch, last one 6 months ago
101
1.2k
apache-2.0
19
First open-source data discovery and observability platform. We make a life for data practitioners easy so you can focus on your business.
Created 2021-07-07
825 commits to main branch, last one 2 months ago
261
812
apache-2.0
38
Egeria core
Created 2018-05-31
20,935 commits to main branch, last one 3 days ago
31
434
apache-2.0
17
Titan Core - Snowflake infrastructure-as-code. Provision environments, automate deploys, CI/CD. Manage RBAC, users, roles, and data access. Declarative Python Resource API. Change Management tool for ...
Created 2023-05-13
265 commits to main branch, last one 5 days ago
Generate and Visualize Data Lineage from query history
Created 2020-03-17
100 commits to master branch, last one about a year ago
51
193
unknown
6
Main repo including core data model, data marts, reference data, terminology, and the clinical concept library
Created 2021-11-12
753 commits to main branch, last one 5 days ago
Reference implementation for real-time Data Lineage tracking for BigQuery using Audit Logs, ZetaSQL and Dataflow.
This repository has been archived (exclude archived)
Created 2020-06-09
57 commits to master branch, last one 12 months ago
71
140
apache-2.0
2
This Apache Atlas is built from the latest release source tarball and patched to be run in a Docker container.
Created 2019-05-24
102 commits to master branch, last one about a year ago
46
140
mit
5
Pebblo enables developers to safely load data and promote their Gen AI app to deployment
Created 2024-01-15
488 commits to main branch, last one 18 days ago
ODD Specification is a universal open standard for collecting metadata.
Created 2020-12-16
188 commits to main branch, last one about a month ago
26
113
apache-2.0
17
HiveMQ Edge is an MQTT gateway that enables interoperability between OT devices and IT systems. It translates diverse protocols into MQTT for streamlined communication and helps organize data into a u...
Created 2023-06-30
1,524 commits to master branch, last one 10 hours ago
28
101
apache-2.0
34
Egeria's Guidance on Governance as well as large media files such as presentations and movies
This repository has been archived (exclude archived)
Created 2017-10-09
397 commits to main branch, last one 2 years ago
Mapping of DWH database tables to business entities, attributes & metrics in Python, with automatic creation of flattened tables
Created 2020-05-26
30 commits to main branch, last one about a year ago
0
63
apache-2.0
2
三足乌数据中台融合数据规划、数据接入、数据开发、数据仓库、数据治理、数据资产、数据服务、数据运维、系统管理等功能模块为一体。打通数据壁垒,解决数据孤岛问题,实现数据的低代码可视化开发,助力政府、企业数字化转型。
Created 2024-09-04
27 commits to master branch, last one 2 months ago
A boilerplate solution for processing image and PDF documents for regulated industries, with lineage and pipeline operations metadata services.
Created 2021-03-16
23 commits to main branch, last one 3 years ago
POC to demonstrate how to alter incoming/outgoing records in Kafka. It's a toy, don't use it in production.
Created 2023-03-31
215 commits to main branch, last one 9 months ago
Data Quality Gate based on AWS
Created 2022-05-31
343 commits to main branch, last one 4 months ago
13
51
apache-2.0
8
Data catalog for everything in your company
This repository has been archived (exclude archived)
Created 2021-10-22
134 commits to master branch, last one about a year ago
Data-Export支持将链上数据导出到MySQL、ES等便于进行大数据处理的存储介质中,解决区块链数据复杂查询、分析、可视化和处理的问题。
Created 2020-10-21
46 commits to master branch, last one about a year ago
Open-source metadata collector based on ODD Specification
This repository has been archived (exclude archived)
Created 2022-02-10
259 commits to main branch, last one about a year ago
Identify and tokenize sensitive data automatically using Cloud DLP and Dataflow
Created 2021-01-05
76 commits to main branch, last one 10 months ago
1
34
apache-2.0
4
Data policy IN, dynamic view OUT: PACE is the Policy As Code Engine. It helps you to programatically create and apply a data policy to a processing platform like Databricks, Snowflake or BigQuery (or ...
Created 2023-10-18
624 commits to alpha branch, last one 3 months ago
Guide to data platforms and tools
Created 2022-03-09
2 commits to main branch, last one 2 years ago