Trending repositories for topic data-warehouse
🦔 PostHog provides open-source web & product analytics, session recording, feature flagging and A/B testing that you can self-host. Get started - free.
data load tool (dlt) is an open source Python library that makes data loading easy 🛠️
A comprehensive guide to building a modern data warehouse with SQL Server, including ETL processes, data modeling, and analytics.
One advanced and mature open-source MPP (Massively Parallel Processing) database. Open source alternative to Greenplum Database.
Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake development.
A curated list of open source tools used in analytics platforms and data engineering ecosystem
🤖 The Semantic Engine for Model Context Protocol(MCP) Clients and AI Agents 🔥
The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.
A curated list of awesome big data frameworks, ressources and other awesomeness.
A comprehensive guide to building a modern data warehouse with SQL Server, including ETL processes, data modeling, and analytics.
A curated list of open source tools used in analytics platforms and data engineering ecosystem
🤖 The Semantic Engine for Model Context Protocol(MCP) Clients and AI Agents 🔥
One advanced and mature open-source MPP (Massively Parallel Processing) database. Open source alternative to Greenplum Database.
data load tool (dlt) is an open source Python library that makes data loading easy 🛠️
Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake development.
🦔 PostHog provides open-source web & product analytics, session recording, feature flagging and A/B testing that you can self-host. Get started - free.
The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.
A curated list of awesome big data frameworks, ressources and other awesomeness.
🦔 PostHog provides open-source web & product analytics, session recording, feature flagging and A/B testing that you can self-host. Get started - free.
data load tool (dlt) is an open source Python library that makes data loading easy 🛠️
A comprehensive guide to building a modern data warehouse with SQL Server, including ETL processes, data modeling, and analytics.
🤖 The Semantic Engine for Model Context Protocol(MCP) Clients and AI Agents 🔥
Privacy and Security focused Segment-alternative, in Golang and React
A curated list of awesome big data frameworks, ressources and other awesomeness.
One advanced and mature open-source MPP (Massively Parallel Processing) database. Open source alternative to Greenplum Database.
The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.
Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake development.
A curated list of open source tools used in analytics platforms and data engineering ecosystem
Titan Core - Snowflake infrastructure-as-code. Provision environments, automate deploys, CI/CD. Manage RBAC, users, roles, and data access. Declarative Python Resource API. Change Management tool for ...
A comprehensive guide to building a modern data warehouse with SQL Server, including ETL processes, data modeling, and analytics.
🤖 The Semantic Engine for Model Context Protocol(MCP) Clients and AI Agents 🔥
data load tool (dlt) is an open source Python library that makes data loading easy 🛠️
🦔 PostHog provides open-source web & product analytics, session recording, feature flagging and A/B testing that you can self-host. Get started - free.
One advanced and mature open-source MPP (Massively Parallel Processing) database. Open source alternative to Greenplum Database.
A curated list of open source tools used in analytics platforms and data engineering ecosystem
Titan Core - Snowflake infrastructure-as-code. Provision environments, automate deploys, CI/CD. Manage RBAC, users, roles, and data access. Declarative Python Resource API. Change Management tool for ...
The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.
Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake development.
Privacy and Security focused Segment-alternative, in Golang and React
A curated list of awesome big data frameworks, ressources and other awesomeness.
Building a modern data warehouse with SQL server, including ETL processes, data modeling, and analytics.
🦔 PostHog provides open-source web & product analytics, session recording, feature flagging and A/B testing that you can self-host. Get started - free.
data load tool (dlt) is an open source Python library that makes data loading easy 🛠️
A comprehensive guide to building a modern data warehouse with SQL Server, including ETL processes, data modeling, and analytics.
A curated list of awesome big data frameworks, ressources and other awesomeness.
🤖 The Semantic Engine for Model Context Protocol(MCP) Clients and AI Agents 🔥
Building a modern data warehouse with SQL server, including ETL processes, data modeling, and analytics.
A curated list of open source tools used in analytics platforms and data engineering ecosystem
The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.
One advanced and mature open-source MPP (Massively Parallel Processing) database. Open source alternative to Greenplum Database.
Open source SQL Query Assistant service for Databases/Warehouses
Privacy and Security focused Segment-alternative, in Golang and React
Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake development.
Useful scripts, udfs, views, and other utilities for migration and data warehouse operations in BigQuery.
Detailed notes and homeworks from 2025 Data Engineering Zoomcamp by Datatalks.Club
Download content packages for SAP Analytics Cloud and SAP Datasphere. Find technical samples, best practices or business scenarios. Packages contain data models, visualisations and sample data (if ap...
An efficient storage and compute engine for both on-prem and cloud-native data analytics.
DomainMOD is an open source application written in PHP & MySQL used to manage your domains and other internet assets in a central location. DomainMOD also includes a Data Warehouse framework that allo...
A comprehensive guide to building a modern data warehouse with SQL Server, including ETL processes, data modeling, and analytics.
🤖 The Semantic Engine for Model Context Protocol(MCP) Clients and AI Agents 🔥
Detailed notes and homeworks from 2025 Data Engineering Zoomcamp by Datatalks.Club
A curated list of open source tools used in analytics platforms and data engineering ecosystem
Download content packages for SAP Analytics Cloud and SAP Datasphere. Find technical samples, best practices or business scenarios. Packages contain data models, visualisations and sample data (if ap...
data load tool (dlt) is an open source Python library that makes data loading easy 🛠️
An end-to-end data pipeline which extracts divvy bikeshare data from web loads it into data lake and datawarehouse transforms it using dbt and finally , a dashboard to visualize the data using looker ...
🦔 PostHog provides open-source web & product analytics, session recording, feature flagging and A/B testing that you can self-host. Get started - free.
An efficient storage and compute engine for both on-prem and cloud-native data analytics.
One advanced and mature open-source MPP (Massively Parallel Processing) database. Open source alternative to Greenplum Database.
Modern serverless lakehouse implementing HOOK methodology, Unified Star Schema (USS), and Analytical Data Storage System (ADSS) principles on Adventure Works. Features programmatic model generation, e...
Open source SQL Query Assistant service for Databases/Warehouses
The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.
Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake development.
Main repo including core data model, data marts, data quality tests, and terminology sets.
DomainMOD is an open source application written in PHP & MySQL used to manage your domains and other internet assets in a central location. DomainMOD also includes a Data Warehouse framework that allo...
Useful scripts, udfs, views, and other utilities for migration and data warehouse operations in BigQuery.
Construct a modern data stack and orchestration the workflows to create high quality data for analytics and ML applications.
One framework to develop, deploy and operate data workflows with Python and SQL.
A comprehensive guide to building a modern data warehouse with SQL Server, including ETL processes, data modeling, and analytics.
Modern serverless lakehouse implementing HOOK methodology, Unified Star Schema (USS), and Analytical Data Storage System (ADSS) principles on Adventure Works. Features programmatic model generation, e...
三足乌数据中台融合数据接入、数据开发、数据仓库、数据治理、数据资产、数据服务、BI可视化、系统管理等功能模块为一体。打通数据壁垒,解决数据孤岛问题,助力企业数字化转型。
Detailed notes and homeworks from 2025 Data Engineering Zoomcamp by Datatalks.Club
Arcane Insight is a data analytics project designed to harness the power of SQLMesh & DuckDB to collect, transform, and analyze data from Blizzard’s Hearthstone API. Focused on card statistics and att...
Personal project for setting up an open source data warehouse.
Building a modern data warehouse with SQL server, including ETL processes, data modeling, and analytics.
🦔 PostHog provides open-source web & product analytics, session recording, feature flagging and A/B testing that you can self-host. Get started - free.
data load tool (dlt) is an open source Python library that makes data loading easy 🛠️
🔥🔥🔥 Open source composable CDP - alternative to hightouch and census.
One advanced and mature open-source MPP (Massively Parallel Processing) database. Open source alternative to Greenplum Database.
A curated list of awesome big data frameworks, ressources and other awesomeness.
The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.
🤖 The Semantic Engine for Model Context Protocol(MCP) Clients and AI Agents 🔥
A curated list of open source tools used in analytics platforms and data engineering ecosystem
Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake development.
Privacy and Security focused Segment-alternative, in Golang and React
Open source SQL Query Assistant service for Databases/Warehouses
Titan Core - Snowflake infrastructure-as-code. Provision environments, automate deploys, CI/CD. Manage RBAC, users, roles, and data access. Declarative Python Resource API. Change Management tool for ...
A comprehensive guide to building a modern data warehouse with SQL Server, including ETL processes, data modeling, and analytics.
Useful scripts, udfs, views, and other utilities for migration and data warehouse operations in BigQuery.
Modern serverless lakehouse implementing HOOK methodology, Unified Star Schema (USS), and Analytical Data Storage System (ADSS) principles on Adventure Works. Features programmatic model generation, e...
Main repo including core data model, data marts, data quality tests, and terminology sets.
三足乌数据中台融合数据接入、数据开发、数据仓库、数据治理、数据资产、数据服务、BI可视化、系统管理等功能模块为一体。打通数据壁垒,解决数据孤岛问题,助力企业数字化转型。
Never sift through endless dbt™ logs again. dbt Command Center is a free, open-source, local web application that provides a user-friendly interface to monitor and manage dbt runs.
One advanced and mature open-source MPP (Massively Parallel Processing) database. Open source alternative to Greenplum Database.
🤖 The Semantic Engine for Model Context Protocol(MCP) Clients and AI Agents 🔥
A curated list of open source tools used in analytics platforms and data engineering ecosystem
Arcane Insight is a data analytics project designed to harness the power of SQLMesh & DuckDB to collect, transform, and analyze data from Blizzard’s Hearthstone API. Focused on card statistics and att...
🔥🔥🔥 Open source composable CDP - alternative to hightouch and census.
Download content packages for SAP Analytics Cloud and SAP Datasphere. Find technical samples, best practices or business scenarios. Packages contain data models, visualisations and sample data (if ap...
data load tool (dlt) is an open source Python library that makes data loading easy 🛠️
An efficient storage and compute engine for both on-prem and cloud-native data analytics.
Titan Core - Snowflake infrastructure-as-code. Provision environments, automate deploys, CI/CD. Manage RBAC, users, roles, and data access. Declarative Python Resource API. Change Management tool for ...
Main repo including core data model, data marts, data quality tests, and terminology sets.
🦔 PostHog provides open-source web & product analytics, session recording, feature flagging and A/B testing that you can self-host. Get started - free.
This is a template you can use for your next data engineering portfolio project.
Easily sync your Postgres database to a Snowflake, ClickHouse, or DuckDB warehouse.
Data Engineering - Metropolitan Transportation Authority (MTA) Subway Data Analysis
Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake development.