21 results found Sort:

3.0k
10.0k
apache-2.0
254
The Metadata Platform for your Data and AI Stack
Created 2015-11-18
10,756 commits to master branch, last one 3 hours ago
1.1k
5.7k
apache-2.0
48
OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team colla...
Created 2021-08-01
11,067 commits to main branch, last one 2 hours ago
960
4.5k
apache-2.0
231
Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting with data.
Created 2019-05-14
2,705 commits to main branch, last one about a month ago
101
1.2k
apache-2.0
19
First open-source data discovery and observability platform. We make a life for data practitioners easy so you can focus on your business.
Created 2021-07-07
825 commits to main branch, last one 2 months ago
352
1.1k
apache-2.0
29
World's most powerful open data catalog for building a high-performance, geo-distributed and federated metadata lake.
Created 2023-04-23
1,990 commits to main branch, last one 5 hours ago
141
1.0k
bsd-2-clause
42
Intake is a lightweight package for finding, investigating, loading and disseminating data.
Created 2017-08-14
2,268 commits to master branch, last one 19 days ago
📙 Awesome Data Catalogs and Observability Platforms.
Created 2021-07-14
91 commits to main branch, last one 4 months ago
38
725
gpl-3.0
41
🐳 The stupidly simple CLI workspace for your data warehouse.
Created 2020-05-27
354 commits to master branch, last one 2 years ago
24
334
mit
10
Work with your web service, database, and streaming schemas in a single format.
Created 2022-12-07
339 commits to main branch, last one 9 months ago
96
282
apache-2.0
13
Scan databases and data warehouses for PII data. Tag tables and columns in data catalogs like Amundsen and Datahub
Created 2019-03-21
280 commits to master branch, last one about a year ago
40
192
apache-2.0
8
Meteor is an easy-to-use, plugin-driven metadata collection framework to extract data from different sources and sink to any data catalog.
Created 2021-03-22
375 commits to main branch, last one about a month ago
Reference implementation for real-time Data Lineage tracking for BigQuery using Audit Logs, ZetaSQL and Dataflow.
This repository has been archived (exclude archived)
Created 2020-06-09
57 commits to master branch, last one 12 months ago
47
140
apache-2.0
16
An intake plugin for parsing an Earth System Model (ESM) catalog and loading assets into xarray datasets.
Created 2018-12-31
1,157 commits to main branch, last one 22 hours ago
5
78
bsd-2-clause
6
Metamapper is a data discovery and documentation platform for improving how teams understand and interact with their data.
Created 2020-06-10
201 commits to master branch, last one 2 years ago
10
65
apache-2.0
6
End-to-end DataOps platform deployed by Terraform.
Created 2022-04-18
14 commits to main branch, last one about a year ago
13
51
apache-2.0
8
Data catalog for everything in your company
This repository has been archived (exclude archived)
Created 2021-10-22
134 commits to master branch, last one about a year ago
Tag Engine automates the process of creating, updating, deleting, and populating metadata in bulk with the Google Cloud services Data Catalog and Dataplex. Tag Engine is licensed under the Apache 2 li...
Created 2021-01-11
594 commits to cloud-run branch, last one about a month ago
The documentation repository is part of the Corporate Linked Data Catalog - short: COLID - application.
Created 2020-07-07
26 commits to master branch, last one about a year ago
Open-source metadata collector based on ODD Specification
This repository has been archived (exclude archived)
Created 2022-02-10
259 commits to main branch, last one about a year ago
Registry of data portals, catalogs, data repositories including data catalogs dataset and catalog description standard
Created 2023-03-25
315 commits to main branch, last one 10 days ago
1
34
apache-2.0
4
Data policy IN, dynamic view OUT: PACE is the Policy As Code Engine. It helps you to programatically create and apply a data policy to a processing platform like Databricks, Snowflake or BigQuery (or ...
Created 2023-10-18
624 commits to alpha branch, last one 3 months ago