Trending repositories for topic data-warehouse
🔥🔥🔥 Open Source Alternative to Hightouch, Census, and RudderStack - Reverse ETL & Data Activation
三足乌数据中台融合数据规划、数据接入、数据开发、数据仓库、数据治理、数据资产、数据服务、数据运维、系统管理等功能模块为一体。打通数据壁垒,解决数据孤岛问题,实现数据的低代码可视化开发,助力政府、企业数字化转型。
data load tool (dlt) is an open source Python library that makes data loading easy 🛠️
A curated list of awesome big data frameworks, ressources and other awesomeness.
Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake development.
The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.
The Cloud Operational Data Store: use SQL to transform, deliver, and act on fast-changing data.
🔥🔥🔥 Open Source Alternative to Hightouch, Census, and RudderStack - Reverse ETL & Data Activation
三足乌数据中台融合数据规划、数据接入、数据开发、数据仓库、数据治理、数据资产、数据服务、数据运维、系统管理等功能模块为一体。打通数据壁垒,解决数据孤岛问题,实现数据的低代码可视化开发,助力政府、企业数字化转型。
data load tool (dlt) is an open source Python library that makes data loading easy 🛠️
Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake development.
The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.
A curated list of awesome big data frameworks, ressources and other awesomeness.
The Cloud Operational Data Store: use SQL to transform, deliver, and act on fast-changing data.
三足乌数据中台融合数据规划、数据接入、数据开发、数据仓库、数据治理、数据资产、数据服务、数据运维、系统管理等功能模块为一体。打通数据壁垒,解决数据孤岛问题,实现数据的低代码可视化开发,助力政府、企业数字化转型。
🔥🔥🔥 Open Source Alternative to Hightouch, Census, and RudderStack - Reverse ETL & Data Activation
三足乌数据中台融合数据规划、数据接入、数据开发、数据仓库、数据治理、数据资产、数据服务、数据运维、系统管理等功能模块为一体。打通数据壁垒,解决数据孤岛问题,实现数据的低代码可视化开发,助力政府、企业数字化转型。
A curated list of awesome big data frameworks, ressources and other awesomeness.
Cloudberry Database - Open source alternative to Greenplum Database. Created by the original Greenplum developers.
data load tool (dlt) is an open source Python library that makes data loading easy 🛠️
The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.
Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake development.
The Cloud Operational Data Store: use SQL to transform, deliver, and act on fast-changing data.
Titan Core - Snowflake infrastructure-as-code. Provision environments, automate deploys, CI/CD. Manage RBAC, users, roles, and data access. Declarative Python Resource API. Change Management tool for ...
Hydra: Column-oriented Postgres. Add scalable analytics to your project in minutes.
Privacy and Security focused Segment-alternative, in Golang and React
A curated list of open source tools used in analytical stacks and data engineering ecosystem
Useful scripts, udfs, views, and other utilities for migration and data warehouse operations in BigQuery.
三足乌数据中台融合数据规划、数据接入、数据开发、数据仓库、数据治理、数据资产、数据服务、数据运维、系统管理等功能模块为一体。打通数据壁垒,解决数据孤岛问题,实现数据的低代码可视化开发,助力政府、企业数字化转型。
🔥🔥🔥 Open Source Alternative to Hightouch, Census, and RudderStack - Reverse ETL & Data Activation
Cloudberry Database - Open source alternative to Greenplum Database. Created by the original Greenplum developers.
Titan Core - Snowflake infrastructure-as-code. Provision environments, automate deploys, CI/CD. Manage RBAC, users, roles, and data access. Declarative Python Resource API. Change Management tool for ...
A curated list of open source tools used in analytical stacks and data engineering ecosystem
data load tool (dlt) is an open source Python library that makes data loading easy 🛠️
The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.
Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake development.
Hydra: Column-oriented Postgres. Add scalable analytics to your project in minutes.
A curated list of awesome big data frameworks, ressources and other awesomeness.
Useful scripts, udfs, views, and other utilities for migration and data warehouse operations in BigQuery.
The Cloud Operational Data Store: use SQL to transform, deliver, and act on fast-changing data.
Open source SQL Query Assistant service for Databases/Warehouses
Privacy and Security focused Segment-alternative, in Golang and React
三足乌数据中台融合数据规划、数据接入、数据开发、数据仓库、数据治理、数据资产、数据服务、数据运维、系统管理等功能模块为一体。打通数据壁垒,解决数据孤岛问题,实现数据的低代码可视化开发,助力政府、企业数字化转型。
🔥🔥🔥 Open Source Alternative to Hightouch, Census, and RudderStack - Reverse ETL & Data Activation
data load tool (dlt) is an open source Python library that makes data loading easy 🛠️
A curated list of awesome big data frameworks, ressources and other awesomeness.
三足乌数据中台融合数据规划、数据接入、数据开发、数据仓库、数据治理、数据资产、数据服务、数据运维、系统管理等功能模块为一体。打通数据壁垒,解决数据孤岛问题,实现数据的低代码可视化开发,助力政府、企业数字化转型。
Hydra: Column-oriented Postgres. Add scalable analytics to your project in minutes.
The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.
Cloudberry Database - Open source alternative to Greenplum Database. Created by the original Greenplum developers.
Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake development.
The Cloud Operational Data Store: use SQL to transform, deliver, and act on fast-changing data.
Privacy and Security focused Segment-alternative, in Golang and React
Titan Core - Snowflake infrastructure-as-code. Provision environments, automate deploys, CI/CD. Manage RBAC, users, roles, and data access. Declarative Python Resource API. Change Management tool for ...
Useful scripts, udfs, views, and other utilities for migration and data warehouse operations in BigQuery.
A curated list of open source tools used in analytical stacks and data engineering ecosystem
🤖 The semantic engine for LLMs, bringing semantic context to AI agents. 🔥
Main repo including core data model, data marts, reference data, terminology, and the clinical concept library
Easily sync your Postgres database to a Snowflake, ClickHouse, or DuckDB warehouse.
三足乌数据中台融合数据规划、数据接入、数据开发、数据仓库、数据治理、数据资产、数据服务、数据运维、系统管理等功能模块为一体。打通数据壁垒,解决数据孤岛问题,实现数据的低代码可视化开发,助力政府、企业数字化转型。
🔥🔥🔥 Open Source Alternative to Hightouch, Census, and RudderStack - Reverse ETL & Data Activation
Cloudberry Database - Open source alternative to Greenplum Database. Created by the original Greenplum developers.
A curated list of open source tools used in analytical stacks and data engineering ecosystem
data load tool (dlt) is an open source Python library that makes data loading easy 🛠️
🤖 The semantic engine for LLMs, bringing semantic context to AI agents. 🔥
Easily sync your Postgres database to a Snowflake, ClickHouse, or DuckDB warehouse.
Titan Core - Snowflake infrastructure-as-code. Provision environments, automate deploys, CI/CD. Manage RBAC, users, roles, and data access. Declarative Python Resource API. Change Management tool for ...
Main repo including core data model, data marts, reference data, terminology, and the clinical concept library
Real-time Data Warehouse with Apache Flink & Apache Kafka & Apache Hudi
Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake development.
The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.
Use sample content to explorer SAP Datasphere. The downloads contain sample data as CSV files, but could also include model / metadata information. See the README files for details.
Hydra: Column-oriented Postgres. Add scalable analytics to your project in minutes.
An efficient storage and compute engine for both on-prem and cloud-native data analytics.
Useful scripts, udfs, views, and other utilities for migration and data warehouse operations in BigQuery.
🔥🔥🔥 Open Source Alternative to Hightouch, Census, and RudderStack - Reverse ETL & Data Activation
A curated list of open source tools used in analytical stacks and data engineering ecosystem
三足乌数据中台融合数据规划、数据接入、数据开发、数据仓库、数据治理、数据资产、数据服务、数据运维、系统管理等功能模块为一体。打通数据壁垒,解决数据孤岛问题,实现数据的低代码可视化开发,助力政府、企业数字化转型。
data load tool (dlt) is an open source Python library that makes data loading easy 🛠️
🔥🔥🔥 Open Source Alternative to Hightouch, Census, and RudderStack - Reverse ETL & Data Activation
A curated list of awesome big data frameworks, ressources and other awesomeness.
Hydra: Column-oriented Postgres. Add scalable analytics to your project in minutes.
The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.
The Cloud Operational Data Store: use SQL to transform, deliver, and act on fast-changing data.
Privacy and Security focused Segment-alternative, in Golang and React
Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake development.
Titan Core - Snowflake infrastructure-as-code. Provision environments, automate deploys, CI/CD. Manage RBAC, users, roles, and data access. Declarative Python Resource API. Change Management tool for ...
Cloudberry Database - Open source alternative to Greenplum Database. Created by the original Greenplum developers.
Open source SQL Query Assistant service for Databases/Warehouses
Useful scripts, udfs, views, and other utilities for migration and data warehouse operations in BigQuery.
Unified storage framework for the entire machine learning lifecycle
A curated list of open source tools used in analytical stacks and data engineering ecosystem
🤖 The semantic engine for LLMs, bringing semantic context to AI agents. 🔥
A curated list of open source tools used in analytical stacks and data engineering ecosystem
data load tool (dlt) is an open source Python library that makes data loading easy 🛠️
三足乌数据中台融合数据规划、数据接入、数据开发、数据仓库、数据治理、数据资产、数据服务、数据运维、系统管理等功能模块为一体。打通数据壁垒,解决数据孤岛问题,实现数据的低代码可视化开发,助力政府、企业数字化转型。
Cloudberry Database - Open source alternative to Greenplum Database. Created by the original Greenplum developers.
Titan Core - Snowflake infrastructure-as-code. Provision environments, automate deploys, CI/CD. Manage RBAC, users, roles, and data access. Declarative Python Resource API. Change Management tool for ...
🤖 The semantic engine for LLMs, bringing semantic context to AI agents. 🔥
An efficient storage and compute engine for both on-prem and cloud-native data analytics.
This is a template you can use for your next data engineering portfolio project.
Data Engineering - Metropolitan Transportation Authority (MTA) Subway Data Analysis
Main repo including core data model, data marts, reference data, terminology, and the clinical concept library
Easily sync your Postgres database to a Snowflake, ClickHouse, or DuckDB warehouse.
Define, govern, and model event data for warehouse-first product analytics.
Hydra: Column-oriented Postgres. Add scalable analytics to your project in minutes.
Data Engineer with Python lecture notes from #datacamp.
Example project demonstrating deployment patterns for real-time streaming workflows with Prefect 2.0