31 results found Sort:
- Filter by Primary Language:
- Python (12)
- Java (9)
- HTML (2)
- TypeScript (1)
- Kotlin (1)
- Go (1)
- Shell (1)
- +
The Metadata Platform for your Data and AI Stack
Created
2015-11-18
10,756 commits to master branch, last one 3 hours ago
OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team colla...
Created
2021-08-01
11,067 commits to main branch, last one 2 hours ago
The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.
Created
2021-08-30
5,068 commits to master branch, last one 14 days ago
:zap: Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.io
Created
2020-12-14
764 commits to main branch, last one a day ago
Collect, aggregate, and visualize a data ecosystem's metadata
Created
2018-07-05
2,830 commits to main branch, last one 11 days ago
SQL Lineage Analysis Tool powered by Python
Created
2019-05-21
375 commits to master branch, last one 6 months ago
First open-source data discovery and observability platform. We make a life for data practitioners easy so you can focus on your business.
Created
2021-07-07
825 commits to main branch, last one 2 months ago
Egeria core
Created
2018-05-31
20,935 commits to main branch, last one 3 days ago
Titan Core - Snowflake infrastructure-as-code. Provision environments, automate deploys, CI/CD. Manage RBAC, users, roles, and data access. Declarative Python Resource API. Change Management tool for ...
Created
2023-05-13
265 commits to main branch, last one 5 days ago
Metrics Observability & Troubleshooting
Created
2023-04-21
404 commits to main branch, last one 9 months ago
Generate and Visualize Data Lineage from query history
Created
2020-03-17
100 commits to master branch, last one about a year ago
Main repo including core data model, data marts, reference data, terminology, and the clinical concept library
Created
2021-11-12
753 commits to main branch, last one 5 days ago
Open Source Data Quality Monitoring.
Created
2023-07-15
174 commits to main branch, last one 18 days ago
Reference implementation for real-time Data Lineage tracking for BigQuery using Audit Logs, ZetaSQL and Dataflow.
This repository has been archived
(exclude archived)
Created
2020-06-09
57 commits to master branch, last one 12 months ago
This Apache Atlas is built from the latest release source tarball and patched to be run in a Docker container.
Created
2019-05-24
102 commits to master branch, last one about a year ago
Pebblo enables developers to safely load data and promote their Gen AI app to deployment
Created
2024-01-15
488 commits to main branch, last one 18 days ago
ODD Specification is a universal open standard for collecting metadata.
Created
2020-12-16
188 commits to main branch, last one about a month ago
HiveMQ Edge is an MQTT gateway that enables interoperability between OT devices and IT systems. It translates diverse protocols into MQTT for streamlined communication and helps organize data into a u...
Created
2023-06-30
1,524 commits to master branch, last one 10 hours ago
Egeria's Guidance on Governance as well as large media files such as presentations and movies
This repository has been archived
(exclude archived)
Created
2017-10-09
397 commits to main branch, last one 2 years ago
Mapping of DWH database tables to business entities, attributes & metrics in Python, with automatic creation of flattened tables
Created
2020-05-26
30 commits to main branch, last one about a year ago
三足乌数据中台融合数据规划、数据接入、数据开发、数据仓库、数据治理、数据资产、数据服务、数据运维、系统管理等功能模块为一体。打通数据壁垒,解决数据孤岛问题,实现数据的低代码可视化开发,助力政府、企业数字化转型。
Created
2024-09-04
27 commits to master branch, last one 2 months ago
A boilerplate solution for processing image and PDF documents for regulated industries, with lineage and pipeline operations metadata services.
Created
2021-03-16
23 commits to main branch, last one 3 years ago
POC to demonstrate how to alter incoming/outgoing records in Kafka. It's a toy, don't use it in production.
Created
2023-03-31
215 commits to main branch, last one 9 months ago
Data Quality Gate based on AWS
Created
2022-05-31
343 commits to main branch, last one 4 months ago
Data catalog for everything in your company
This repository has been archived
(exclude archived)
Created
2021-10-22
134 commits to master branch, last one about a year ago
Data-Export支持将链上数据导出到MySQL、ES等便于进行大数据处理的存储介质中,解决区块链数据复杂查询、分析、可视化和处理的问题。
Created
2020-10-21
46 commits to master branch, last one about a year ago
Open-source metadata collector based on ODD Specification
This repository has been archived
(exclude archived)
Created
2022-02-10
259 commits to main branch, last one about a year ago
Identify and tokenize sensitive data automatically using Cloud DLP and Dataflow
Created
2021-01-05
76 commits to main branch, last one 10 months ago
Data policy IN, dynamic view OUT: PACE is the Policy As Code Engine. It helps you to programatically create and apply a data policy to a processing platform like Databricks, Snowflake or BigQuery (or ...
Created
2023-10-18
624 commits to alpha branch, last one 3 months ago
Guide to data platforms and tools
Created
2022-03-09
2 commits to main branch, last one 2 years ago