33 results found Sort:
- Filter by Primary Language:
- Python (11)
- Java (9)
- Shell (2)
- HTML (2)
- Kotlin (1)
- Go (1)
- TypeScript (1)
- +
The Metadata Platform for your Data and AI Stack
Created
2015-11-18
11,618 commits to master branch, last one 11 hours ago
OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team colla...
Created
2021-08-01
12,297 commits to main branch, last one 19 hours ago
:zap: Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.io
Created
2020-12-14
788 commits to main branch, last one 10 days ago
The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.
Created
2021-08-30
5,286 commits to master branch, last one 4 days ago
Collect, aggregate, and visualize a data ecosystem's metadata
Created
2018-07-05
2,856 commits to main branch, last one 16 days ago
SQL Lineage Analysis Tool powered by Python
Created
2019-05-21
396 commits to master branch, last one 28 days ago
First open-source data discovery and observability platform. We make a life for data practitioners easy so you can focus on your business.
Created
2021-07-07
827 commits to main branch, last one about a month ago
Egeria core
Created
2018-05-31
21,067 commits to main branch, last one 11 days ago
Titan Core - Snowflake infrastructure-as-code. Provision environments, automate deploys, CI/CD. Manage RBAC, users, roles, and data access. Declarative Python Resource API. Change Management tool for ...
Created
2023-05-13
276 commits to main branch, last one about a month ago
Metrics Observability & Troubleshooting
Created
2023-04-21
404 commits to main branch, last one about a year ago
Generate and Visualize Data Lineage from query history
Created
2020-03-17
100 commits to master branch, last one about a year ago
Main repo including core data model, data marts, data quality tests, and terminology sets.
Created
2021-11-12
937 commits to main branch, last one a day ago
Open Source Data Quality Monitoring.
Created
2023-07-15
193 commits to main branch, last one 2 days ago
Reference implementation for real-time Data Lineage tracking for BigQuery using Audit Logs, ZetaSQL and Dataflow.
This repository has been archived
(exclude archived)
Created
2020-06-09
57 commits to master branch, last one about a year ago
Pebblo enables developers to safely load data and promote their Gen AI app to deployment
Created
2024-01-15
492 commits to main branch, last one about a month ago
This Apache Atlas is built from the latest release source tarball and patched to be run in a Docker container.
Created
2019-05-24
102 commits to master branch, last one 2 years ago
ODD Specification is a universal open standard for collecting metadata.
Created
2020-12-16
188 commits to main branch, last one 5 months ago
HiveMQ Edge is an MQTT gateway that enables interoperability between OT devices and IT systems. It translates diverse protocols into MQTT for streamlined communication and helps organize data into a u...
Created
2023-06-30
2,072 commits to master branch, last one a day ago
Egeria's Guidance on Governance as well as large media files such as presentations and movies
This repository has been archived
(exclude archived)
Created
2017-10-09
397 commits to main branch, last one 2 years ago
Mapping of DWH database tables to business entities, attributes & metrics in Python, with automatic creation of flattened tables
Created
2020-05-26
30 commits to main branch, last one 2 years ago
三足乌数据中台融合数据接入、数据开发、数据仓库、数据治理、数据资产、数据服务、BI可视化、系统管理等功能模块为一体。打通数据壁垒,解决数据孤岛问题,助力企业数字化转型。
Created
2024-09-04
29 commits to master branch, last one about a month ago
POC to demonstrate how to alter incoming/outgoing records in Kafka. It's a toy, don't use it in production.
Created
2023-03-31
215 commits to main branch, last one about a year ago
A boilerplate solution for processing image and PDF documents for regulated industries, with lineage and pipeline operations metadata services.
Created
2021-03-16
23 commits to main branch, last one 3 years ago
Data Quality Gate based on AWS
Created
2022-05-31
343 commits to main branch, last one 9 months ago
Data catalog for everything in your company
This repository has been archived
(exclude archived)
Created
2021-10-22
134 commits to master branch, last one about a year ago
System Design, Solution Architecture, Data Systems Practice
Created
2022-07-31
133 commits to master branch, last one 7 days ago
Data-Export支持将链上数据导出到MySQL、ES等便于进行大数据处理的存储介质中,解决区块链数据复杂查询、分析、可视化和处理的问题。
Created
2020-10-21
46 commits to master branch, last one 2 years ago
Open-source metadata collector based on ODD Specification
This repository has been archived
(exclude archived)
Created
2022-02-10
259 commits to main branch, last one about a year ago
Identify and tokenize sensitive data automatically using Cloud DLP and Dataflow
Created
2021-01-05
76 commits to main branch, last one about a year ago
A demo of Bufstream, a drop-in replacement for Apache Kafka that's 8x less expensive to operate and brings broker-side schema awareness to Kafka
Created
2024-07-02
28 commits to main branch, last one 11 days ago