33 results found Sort:

3.1k
10.5k
apache-2.0
255
The Metadata Platform for your Data and AI Stack
Created 2015-11-18
11,618 commits to master branch, last one 11 hours ago
1.2k
6.4k
apache-2.0
49
OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team colla...
Created 2021-08-01
12,297 commits to main branch, last one 19 hours ago
230
2.1k
apache-2.0
15
:zap: Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.io
Created 2020-12-14
788 commits to main branch, last one 10 days ago
182
2.0k
apache-2.0
12
The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.
Created 2021-08-30
5,286 commits to master branch, last one 4 days ago
340
1.9k
apache-2.0
46
Collect, aggregate, and visualize a data ecosystem's metadata
Created 2018-07-05
2,856 commits to main branch, last one 16 days ago
247
1.4k
mit
22
SQL Lineage Analysis Tool powered by Python
Created 2019-05-21
396 commits to master branch, last one 28 days ago
122
1.3k
apache-2.0
18
First open-source data discovery and observability platform. We make a life for data practitioners easy so you can focus on your business.
Created 2021-07-07
827 commits to main branch, last one about a month ago
259
838
apache-2.0
37
Egeria core
Created 2018-05-31
21,067 commits to main branch, last one 11 days ago
35
466
apache-2.0
18
Titan Core - Snowflake infrastructure-as-code. Provision environments, automate deploys, CI/CD. Manage RBAC, users, roles, and data access. Declarative Python Resource API. Change Management tool for ...
Created 2023-05-13
276 commits to main branch, last one about a month ago
Generate and Visualize Data Lineage from query history
Created 2020-03-17
100 commits to master branch, last one about a year ago
79
247
unknown
15
Main repo including core data model, data marts, data quality tests, and terminology sets.
Created 2021-11-12
937 commits to main branch, last one a day ago
Reference implementation for real-time Data Lineage tracking for BigQuery using Audit Logs, ZetaSQL and Dataflow.
This repository has been archived (exclude archived)
Created 2020-06-09
57 commits to master branch, last one about a year ago
48
141
mit
4
Pebblo enables developers to safely load data and promote their Gen AI app to deployment
Created 2024-01-15
492 commits to main branch, last one about a month ago
73
141
apache-2.0
1
This Apache Atlas is built from the latest release source tarball and patched to be run in a Docker container.
Created 2019-05-24
102 commits to master branch, last one 2 years ago
ODD Specification is a universal open standard for collecting metadata.
Created 2020-12-16
188 commits to main branch, last one 5 months ago
31
122
apache-2.0
18
HiveMQ Edge is an MQTT gateway that enables interoperability between OT devices and IT systems. It translates diverse protocols into MQTT for streamlined communication and helps organize data into a u...
Created 2023-06-30
2,072 commits to master branch, last one a day ago
29
104
apache-2.0
32
Egeria's Guidance on Governance as well as large media files such as presentations and movies
This repository has been archived (exclude archived)
Created 2017-10-09
397 commits to main branch, last one 2 years ago
Mapping of DWH database tables to business entities, attributes & metrics in Python, with automatic creation of flattened tables
Created 2020-05-26
30 commits to main branch, last one 2 years ago
0
64
apache-2.0
2
三足乌数据中台融合数据接入、数据开发、数据仓库、数据治理、数据资产、数据服务、BI可视化、系统管理等功能模块为一体。打通数据壁垒,解决数据孤岛问题,助力企业数字化转型。
Created 2024-09-04
29 commits to master branch, last one about a month ago
POC to demonstrate how to alter incoming/outgoing records in Kafka. It's a toy, don't use it in production.
Created 2023-03-31
215 commits to main branch, last one about a year ago
A boilerplate solution for processing image and PDF documents for regulated industries, with lineage and pipeline operations metadata services.
Created 2021-03-16
23 commits to main branch, last one 3 years ago
Data Quality Gate based on AWS
Created 2022-05-31
343 commits to main branch, last one 9 months ago
13
50
apache-2.0
7
Data catalog for everything in your company
This repository has been archived (exclude archived)
Created 2021-10-22
134 commits to master branch, last one about a year ago
System Design, Solution Architecture, Data Systems Practice
Created 2022-07-31
133 commits to master branch, last one 7 days ago
Data-Export支持将链上数据导出到MySQL、ES等便于进行大数据处理的存储介质中,解决区块链数据复杂查询、分析、可视化和处理的问题。
Created 2020-10-21
46 commits to master branch, last one 2 years ago
Open-source metadata collector based on ODD Specification
This repository has been archived (exclude archived)
Created 2022-02-10
259 commits to main branch, last one about a year ago
Identify and tokenize sensitive data automatically using Cloud DLP and Dataflow
Created 2021-01-05
76 commits to main branch, last one about a year ago
A demo of Bufstream, a drop-in replacement for Apache Kafka that's 8x less expensive to operate and brings broker-side schema awareness to Kafka
Created 2024-07-02
28 commits to main branch, last one 11 days ago