Trending repositories for topic dbt
OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team colla...
Apache Doris is an easy-to-use, high performance and unified analytics database.
Efficient data transformation and modeling framework that is backwards compatible with dbt.
🧙 Build, run, and manage data pipelines for integrating and transforming data.
Run your dbt Core projects as Apache Airflow DAGs and Task Groups with a few lines of code
Business intelligence as code: build fast, interactive data visualizations in SQL and markdown
dbt (http://getdbt.com) adapter for DuckDB (http://duckdb.org)
A data engineering project with Kafka, Spark Streaming, dbt, Docker, Airflow, Terraform, GCP and much more!
Get started with dbt in less than 1 minute from `git clone` to `dbt docs serve` for free!
This extension makes vscode seamlessly work with dbt™: Auto-complete, preview, column lineage, AI docs generation, health checks, cost estimation etc
🔥🔥🔥 Open Source Alternative to Hightouch, Census, and RudderStack - Reverse ETL & Data Activation
Efficient data transformation and modeling framework that is backwards compatible with dbt.
Run your dbt Core projects as Apache Airflow DAGs and Task Groups with a few lines of code
Get started with dbt in less than 1 minute from `git clone` to `dbt docs serve` for free!
A data engineering project with Kafka, Spark Streaming, dbt, Docker, Airflow, Terraform, GCP and much more!
OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team colla...
dbt (http://getdbt.com) adapter for DuckDB (http://duckdb.org)
This extension makes vscode seamlessly work with dbt™: Auto-complete, preview, column lineage, AI docs generation, health checks, cost estimation etc
Apache Doris is an easy-to-use, high performance and unified analytics database.
Business intelligence as code: build fast, interactive data visualizations in SQL and markdown
🧙 Build, run, and manage data pipelines for integrating and transforming data.
🔥🔥🔥 Open Source Alternative to Hightouch, Census, and RudderStack - Reverse ETL & Data Activation
OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team colla...
Apache Doris is an easy-to-use, high performance and unified analytics database.
Efficient data transformation and modeling framework that is backwards compatible with dbt.
Business intelligence as code: build fast, interactive data visualizations in SQL and markdown
A data engineering project with Kafka, Spark Streaming, dbt, Docker, Airflow, Terraform, GCP and much more!
Query Snowflake tables locally with DuckDB, without any need for a running warehouse
🧙 Build, run, and manage data pipelines for integrating and transforming data.
:zap: Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.io
Run your dbt Core projects as Apache Airflow DAGs and Task Groups with a few lines of code
Main repo including core data model, data marts, reference data, terminology, and the clinical concept library
Open Source LeetCode for PySpark, Spark, Pandas and DBT/Snowflake
Get started with dbt in less than 1 minute from `git clone` to `dbt docs serve` for free!
A portable Datamart and Business Intelligence suite built with Docker, Dagster, dbt, DuckDB and Superset
dbt (http://getdbt.com) adapter for DuckDB (http://duckdb.org)
This project implements an ELT (Extract - Load - Transform) data pipeline with the goodreads dataset, using dagster (orchestration), spark (calculation) and dbt (transformation)
🥪🦘 An open source sandbox project exploring dbt workflows via a fictional sandwich shop's data.
Query Snowflake tables locally with DuckDB, without any need for a running warehouse
This project implements an ELT (Extract - Load - Transform) data pipeline with the goodreads dataset, using dagster (orchestration), spark (calculation) and dbt (transformation)
Open Source LeetCode for PySpark, Spark, Pandas and DBT/Snowflake
A data engineering project with Kafka, Spark Streaming, dbt, Docker, Airflow, Terraform, GCP and much more!
Get started with dbt in less than 1 minute from `git clone` to `dbt docs serve` for free!
Main repo including core data model, data marts, reference data, terminology, and the clinical concept library
Arquitetura CRM de Baixo Custo com Gen AI, projetada para startups que precisam processar e analisar dados de vendas de forma eficiente.
A portable Datamart and Business Intelligence suite built with Docker, Dagster, dbt, DuckDB and Superset
Efficient data transformation and modeling framework that is backwards compatible with dbt.
🥪🦘 An open source sandbox project exploring dbt workflows via a fictional sandwich shop's data.
Run your dbt Core projects as Apache Airflow DAGs and Task Groups with a few lines of code
OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team colla...
Business intelligence as code: build fast, interactive data visualizations in SQL and markdown
Construct a modern data stack and orchestration the workflows to create high quality data for analytics and ML applications.
A dbt package from SELECT to help you monitor Snowflake performance and costs
OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team colla...
Business intelligence as code: build fast, interactive data visualizations in SQL and markdown
Apache Doris is an easy-to-use, high performance and unified analytics database.
Efficient data transformation and modeling framework that is backwards compatible with dbt.
A data engineering project with Kafka, Spark Streaming, dbt, Docker, Airflow, Terraform, GCP and much more!
🧙 Build, run, and manage data pipelines for integrating and transforming data.
Run your dbt Core projects as Apache Airflow DAGs and Task Groups with a few lines of code
:zap: Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.io
dbt (http://getdbt.com) adapter for DuckDB (http://duckdb.org)
Arquitetura CRM de Baixo Custo com Gen AI, projetada para startups que precisam processar e analisar dados de vendas de forma eficiente.
The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.
Supplementary Materials for the The Complete dbt (Data Build Tool) Bootcamp Udemy course
Query Snowflake tables locally with DuckDB, without any need for a running warehouse
The data-validation toolkit for enhanced dbt (data build tool) PR review
A portable Datamart and Business Intelligence suite built with Docker, Dagster, dbt, DuckDB and Superset
Arquitetura CRM de Baixo Custo com Gen AI, projetada para startups que precisam processar e analisar dados de vendas de forma eficiente.
Query Snowflake tables locally with DuckDB, without any need for a running warehouse
Open Source LeetCode for PySpark, Spark, Pandas and DBT/Snowflake
A data engineering project with Kafka, Spark Streaming, dbt, Docker, Airflow, Terraform, GCP and much more!
To provide a deeper understanding of how the modern, open-source data stack consisting of Iceberg, dbt, Trino, and Hive operates within a music streaming platform, let’s delve into the detailed workfl...
A portable Datamart and Business Intelligence suite built with Docker, Dagster, dbt, DuckDB and Superset
🥪🦘 An open source sandbox project exploring dbt workflows via a fictional sandwich shop's data.
Playground for Lakehouse (Iceberg, Hudi, Spark, Flink, Trino, DBT, Airflow, Kafka, Debezium CDC)
The data-validation toolkit for enhanced dbt (data build tool) PR review
A portable Datamart and Business Intelligence suite built with Docker, Mage, dbt, DuckDB and Superset
Main repo including core data model, data marts, reference data, terminology, and the clinical concept library
Get started with dbt in less than 1 minute from `git clone` to `dbt docs serve` for free!
Run your dbt Core projects as Apache Airflow DAGs and Task Groups with a few lines of code
Efficient data transformation and modeling framework that is backwards compatible with dbt.
Query Snowflake tables locally with DuckDB, without any need for a running warehouse
Open Source LeetCode for PySpark, Spark, Pandas and DBT/Snowflake
A portable Datamart and Business Intelligence suite built with Docker, Mage, dbt, DuckDB and Superset
Arquitetura CRM de Baixo Custo com Gen AI, projetada para startups que precisam processar e analisar dados de vendas de forma eficiente.
To provide a deeper understanding of how the modern, open-source data stack consisting of Iceberg, dbt, Trino, and Hive operates within a music streaming platform, let’s delve into the detailed workfl...
Datailot-cli is the command line interface for accessing the AI teammate for engineers to ensure best practices in their SQL and dbt projects.
Apache Doris is an easy-to-use, high performance and unified analytics database.
OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team colla...
Business intelligence as code: build fast, interactive data visualizations in SQL and markdown
🧙 Build, run, and manage data pipelines for integrating and transforming data.
🔥🔥🔥 Open Source Alternative to Hightouch, Census, and RudderStack - Reverse ETL & Data Activation
Efficient data transformation and modeling framework that is backwards compatible with dbt.
Run your dbt Core projects as Apache Airflow DAGs and Task Groups with a few lines of code
The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.
:zap: Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.io
dbt (http://getdbt.com) adapter for DuckDB (http://duckdb.org)
The data-validation toolkit for enhanced dbt (data build tool) PR review
Port(ish) of Great Expectations to dbt test macros
A data engineering project with Kafka, Spark Streaming, dbt, Docker, Airflow, Terraform, GCP and much more!
Supplementary Materials for the The Complete dbt (Data Build Tool) Bootcamp Udemy course
🎨 UI for the Free Data Engineering Zoomcamp Course provided by DataTalksClub
The data-validation toolkit for enhanced dbt (data build tool) PR review
🥪🦘 An open source sandbox project exploring dbt workflows via a fictional sandwich shop's data.
A Python package that creates fine-grained dbt tasks on Apache Airflow
A dbt-core plugin to weave together multi-project dbt-core deployments
Discover the simplicity and strength of Duckdb, dbt, and Iceberg in this project. Create an efficient, versatile data analytics solution for valuable insights.
This repo demonstrate a comprehensive modern data stack using popular open-source tools.
A portable Datamart and Business Intelligence suite built with Docker, Dagster, dbt, DuckDB and Superset
DBT Package reproducing dbt incremental materialization leveraging on Snowflake streams
Playground for Lakehouse (Iceberg, Hudi, Spark, Flink, Trino, DBT, Airflow, Kafka, Debezium CDC)
A curated list of awesome public DBT projects
🎨 UI for the Free Data Engineering Zoomcamp Course provided by DataTalksClub
Showcase of advanced use cases relating to CI in dbt
Datailot-cli is the command line interface for accessing the AI teammate for engineers to ensure best practices in their SQL and dbt projects.
This project implements an ELT (Extract - Load - Transform) data pipeline with the goodreads dataset, using dagster (orchestration), spark (calculation) and dbt (transformation)
The DBT of ML, as Aligned describes data dependencies in ML systems, and reduce technical data debt
Run your dbt Core projects as Apache Airflow DAGs and Task Groups with a few lines of code
Code/Notes for the Data Engineering Zoomcamp by DataTalksClub
Get started with dbt in less than 1 minute from `git clone` to `dbt docs serve` for free!