Trending repositories for topic dbt
Business intelligence as code: build fast, interactive data visualizations in SQL and markdown
Apache Doris is an easy-to-use, high performance and unified analytics database.
A data engineering project with Kafka, Spark Streaming, dbt, Docker, Airflow, Terraform, GCP and much more!
OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team colla...
The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.
Run your dbt Core projects as Apache Airflow DAGs and Task Groups with a few lines of code
Efficient data transformation and modeling framework that is backwards compatible with dbt.
🧙 Build, run, and manage data pipelines for integrating and transforming data.
A portable Datamart and Business Intelligence suite built with Docker, Dagster, dbt, DuckDB, PostgreSQL and Superset
The data-validation toolkit for enhanced dbt (data build tool) PR review
Playground for Lakehouse (Iceberg, Hudi, Spark, Flink, Trino, DBT, Airflow, Kafka, Debezium CDC)
Query Snowflake tables locally with DuckDB, without any need for a running warehouse
A free to use dbt package for creating and loading Data Vault 2.0 compliant Data Warehouses (powered by dbt, an open source data engineering tool, registered trademark of dbt Labs)
🔥🔥🔥 Open Source Alternative to Hightouch, Census, and RudderStack - Reverse ETL & Data Activation
Playground for Lakehouse (Iceberg, Hudi, Spark, Flink, Trino, DBT, Airflow, Kafka, Debezium CDC)
A data engineering project with Kafka, Spark Streaming, dbt, Docker, Airflow, Terraform, GCP and much more!
A portable Datamart and Business Intelligence suite built with Docker, Dagster, dbt, DuckDB, PostgreSQL and Superset
Query Snowflake tables locally with DuckDB, without any need for a running warehouse
The data-validation toolkit for enhanced dbt (data build tool) PR review
Business intelligence as code: build fast, interactive data visualizations in SQL and markdown
Run your dbt Core projects as Apache Airflow DAGs and Task Groups with a few lines of code
The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.
A free to use dbt package for creating and loading Data Vault 2.0 compliant Data Warehouses (powered by dbt, an open source data engineering tool, registered trademark of dbt Labs)
Port(ish) of Great Expectations to dbt test macros
Efficient data transformation and modeling framework that is backwards compatible with dbt.
OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team colla...
Apache Doris is an easy-to-use, high performance and unified analytics database.
🧙 Build, run, and manage data pipelines for integrating and transforming data.
🔥🔥🔥 Open Source Alternative to Hightouch, Census, and RudderStack - Reverse ETL & Data Activation
Business intelligence as code: build fast, interactive data visualizations in SQL and markdown
OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team colla...
A data engineering project with Kafka, Spark Streaming, dbt, Docker, Airflow, Terraform, GCP and much more!
Apache Doris is an easy-to-use, high performance and unified analytics database.
🧙 Build, run, and manage data pipelines for integrating and transforming data.
Efficient data transformation and modeling framework that is backwards compatible with dbt.
The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.
Run your dbt Core projects as Apache Airflow DAGs and Task Groups with a few lines of code
Supplementary Materials for the The Complete dbt (Data Build Tool) Bootcamp Udemy course
dbt (http://getdbt.com) adapter for DuckDB (http://duckdb.org)
:zap: Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.io
The data-validation toolkit for enhanced dbt (data build tool) PR review
Query Snowflake tables locally with DuckDB, without any need for a running warehouse
A portable Datamart and Business Intelligence suite built with Docker, Dagster, dbt, DuckDB, PostgreSQL and Superset
:fishing_pole_and_fish: List of `pre-commit` hooks to ensure the quality of your `dbt` projects.
A data engineering project with Kafka, Spark Streaming, dbt, Docker, Airflow, Terraform, GCP and much more!
Query Snowflake tables locally with DuckDB, without any need for a running warehouse
🥪🦘 An open source sandbox project exploring dbt workflows via a fictional sandwich shop's data.
Playground for Lakehouse (Iceberg, Hudi, Spark, Flink, Trino, DBT, Airflow, Kafka, Debezium CDC)
Business intelligence as code: build fast, interactive data visualizations in SQL and markdown
A portable Datamart and Business Intelligence suite built with Docker, Dagster, dbt, DuckDB, PostgreSQL and Superset
The data-validation toolkit for enhanced dbt (data build tool) PR review
A curated list of awesome public DBT projects
Open Source LeetCode for PySpark, Spark, Pandas and DBT/Snowflake
Run your dbt Core projects as Apache Airflow DAGs and Task Groups with a few lines of code
Supplementary Materials for the The Complete dbt (Data Build Tool) Bootcamp Udemy course
OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team colla...
A dbt package for modelling dbt metadata. https://brooklyn-data.github.io/dbt_artifacts
Efficient data transformation and modeling framework that is backwards compatible with dbt.
:fishing_pole_and_fish: List of `pre-commit` hooks to ensure the quality of your `dbt` projects.
Business intelligence as code: build fast, interactive data visualizations in SQL and markdown
OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team colla...
Apache Doris is an easy-to-use, high performance and unified analytics database.
Run your dbt Core projects as Apache Airflow DAGs and Task Groups with a few lines of code
🧙 Build, run, and manage data pipelines for integrating and transforming data.
Efficient data transformation and modeling framework that is backwards compatible with dbt.
A data engineering project with Kafka, Spark Streaming, dbt, Docker, Airflow, Terraform, GCP and much more!
The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.
:zap: Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.io
dbt (http://getdbt.com) adapter for DuckDB (http://duckdb.org)
Supplementary Materials for the The Complete dbt (Data Build Tool) Bootcamp Udemy course
The data-validation toolkit for enhanced dbt (data build tool) PR review
Arquitetura CRM de Baixo Custo com Gen AI, projetada para startups que precisam processar e analisar dados de vendas de forma eficiente.
🔥🔥🔥 Open Source Alternative to Hightouch, Census, and RudderStack - Reverse ETL & Data Activation
Query Snowflake tables locally with DuckDB, without any need for a running warehouse
Arquitetura CRM de Baixo Custo com Gen AI, projetada para startups que precisam processar e analisar dados de vendas de forma eficiente.
Run your dbt Core projects as Apache Airflow DAGs and Task Groups with a few lines of code
Query Snowflake tables locally with DuckDB, without any need for a running warehouse
A portable Datamart and Business Intelligence suite built with Docker, Mage, dbt, DuckDB and Superset
Open Source LeetCode for PySpark, Spark, Pandas and DBT/Snowflake
To provide a deeper understanding of how the modern, open-source data stack consisting of Iceberg, dbt, Trino, and Hive operates within a music streaming platform, let’s delve into the detailed workfl...
🥪🦘 An open source sandbox project exploring dbt workflows via a fictional sandwich shop's data.
The data-validation toolkit for enhanced dbt (data build tool) PR review
Playground for Lakehouse (Iceberg, Hudi, Spark, Flink, Trino, DBT, Airflow, Kafka, Debezium CDC)
A portable Datamart and Business Intelligence suite built with Docker, Dagster, dbt, DuckDB, PostgreSQL and Superset
A curated list of awesome public DBT projects
Get started with dbt in less than 1 minute from `git clone` to `dbt docs serve` for free!
A data engineering project with Kafka, Spark Streaming, dbt, Docker, Airflow, Terraform, GCP and much more!
A dbt-core plugin to weave together multi-project dbt-core deployments
Business intelligence as code: build fast, interactive data visualizations in SQL and markdown
Open Source LeetCode for PySpark, Spark, Pandas and DBT/Snowflake
Query Snowflake tables locally with DuckDB, without any need for a running warehouse
A portable Datamart and Business Intelligence suite built with Docker, Mage, dbt, DuckDB and Superset
Arquitetura CRM de Baixo Custo com Gen AI, projetada para startups que precisam processar e analisar dados de vendas de forma eficiente.
To provide a deeper understanding of how the modern, open-source data stack consisting of Iceberg, dbt, Trino, and Hive operates within a music streaming platform, let’s delve into the detailed workfl...
Apache Doris is an easy-to-use, high performance and unified analytics database.
OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team colla...
🧙 Build, run, and manage data pipelines for integrating and transforming data.
Business intelligence as code: build fast, interactive data visualizations in SQL and markdown
🔥🔥🔥 Open Source Alternative to Hightouch, Census, and RudderStack - Reverse ETL & Data Activation
Efficient data transformation and modeling framework that is backwards compatible with dbt.
Run your dbt Core projects as Apache Airflow DAGs and Task Groups with a few lines of code
The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.
dbt (http://getdbt.com) adapter for DuckDB (http://duckdb.org)
:zap: Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.io
The data-validation toolkit for enhanced dbt (data build tool) PR review
Port(ish) of Great Expectations to dbt test macros
Supplementary Materials for the The Complete dbt (Data Build Tool) Bootcamp Udemy course
A data engineering project with Kafka, Spark Streaming, dbt, Docker, Airflow, Terraform, GCP and much more!
🎨 UI for the Free Data Engineering Zoomcamp Course provided by DataTalksClub
The data-validation toolkit for enhanced dbt (data build tool) PR review
🥪🦘 An open source sandbox project exploring dbt workflows via a fictional sandwich shop's data.
A Python package that creates fine-grained dbt tasks on Apache Airflow
A dbt-core plugin to weave together multi-project dbt-core deployments
DBT Package reproducing dbt incremental materialization leveraging on Snowflake streams
Discover the simplicity and strength of Duckdb, dbt, and Iceberg in this project. Create an efficient, versatile data analytics solution for valuable insights.
A portable Datamart and Business Intelligence suite built with Docker, Dagster, dbt, DuckDB, PostgreSQL and Superset
🎨 UI for the Free Data Engineering Zoomcamp Course provided by DataTalksClub
A curated list of awesome public DBT projects
This repo demonstrate a comprehensive modern data stack using popular open-source tools.
Playground for Lakehouse (Iceberg, Hudi, Spark, Flink, Trino, DBT, Airflow, Kafka, Debezium CDC)
🌲 Open source, serverless, and local-first data hub for Gitcoin Grants data!
Showcase of advanced use cases relating to CI in dbt
The DBT of ML, as Aligned describes data dependencies in ML systems, and reduce technical data debt
This project implements an ELT (Extract - Load - Transform) data pipeline with the goodreads dataset, using dagster (orchestration), spark (calculation) and dbt (transformation)
Run your dbt Core projects as Apache Airflow DAGs and Task Groups with a few lines of code
Code/Notes for the Data Engineering Zoomcamp by DataTalksClub