Trending repositories for topic data-warehouse
Cloudberry Database - Next generation unified database for Analytics and AI
data load tool (dlt) is an open source Python library that makes data loading easy 🛠️
Wren Engine is the backbone of the semantic layer - The semantic engine for LLMs, bringing business context to AI agents.
A curated list of awesome big data frameworks, ressources and other awesomeness.
A curated list of open source tools used in analytical stacks and data engineering ecosystem
Snowflake infrastructure-as-code. Provision environments, automate deploys, CI/CD. Manage RBAC, users, roles, and data access. Declarative Python Resource API. Change Management tool for the Snowflake...
DomainMOD is an open source application written in PHP & MySQL used to manage your domains and other internet assets in a central location. DomainMOD also includes a Data Warehouse framework that allo...
🔥🔥🔥 Open Source Alternative to Hightouch, Census, and RudderStack - Reverse ETL & Customer Data Platform (CDP)
Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake development.
The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.
Hydra: Column-oriented Postgres. Add scalable analytics to your project in minutes.
Privacy and Security focused Segment-alternative, in Golang and React
Cloudberry Database - Next generation unified database for Analytics and AI
Wren Engine is the backbone of the semantic layer - The semantic engine for LLMs, bringing business context to AI agents.
A curated list of open source tools used in analytical stacks and data engineering ecosystem
data load tool (dlt) is an open source Python library that makes data loading easy 🛠️
Snowflake infrastructure-as-code. Provision environments, automate deploys, CI/CD. Manage RBAC, users, roles, and data access. Declarative Python Resource API. Change Management tool for the Snowflake...
DomainMOD is an open source application written in PHP & MySQL used to manage your domains and other internet assets in a central location. DomainMOD also includes a Data Warehouse framework that allo...
🔥🔥🔥 Open Source Alternative to Hightouch, Census, and RudderStack - Reverse ETL & Customer Data Platform (CDP)
Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake development.
The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.
Hydra: Column-oriented Postgres. Add scalable analytics to your project in minutes.
Privacy and Security focused Segment-alternative, in Golang and React
A curated list of awesome big data frameworks, ressources and other awesomeness.
Cloudberry Database - Next generation unified database for Analytics and AI
data load tool (dlt) is an open source Python library that makes data loading easy 🛠️
Wren Engine is the backbone of the semantic layer - The semantic engine for LLMs, bringing business context to AI agents.
A curated list of awesome big data frameworks, ressources and other awesomeness.
Privacy and Security focused Segment-alternative, in Golang and React
🔥🔥🔥 Open Source Alternative to Hightouch, Census, and RudderStack - Reverse ETL & Customer Data Platform (CDP)
The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.
Snowflake infrastructure-as-code. Provision environments, automate deploys, CI/CD. Manage RBAC, users, roles, and data access. Declarative Python Resource API. Change Management tool for the Snowflake...
A curated list of open source tools used in analytical stacks and data engineering ecosystem
Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake development.
TensorBase is a new big data warehousing with modern efforts.
Hydra: Column-oriented Postgres. Add scalable analytics to your project in minutes.
DomainMOD is an open source application written in PHP & MySQL used to manage your domains and other internet assets in a central location. DomainMOD also includes a Data Warehouse framework that allo...
Useful scripts, udfs, views, and other utilities for migration and data warehouse operations in BigQuery.
Easily sync your Postgres database to a Snowflake, ClickHouse, or DuckDB warehouse.
This is a template you can use for your next data engineering portfolio project.
Wren Engine is the backbone of the semantic layer - The semantic engine for LLMs, bringing business context to AI agents.
Cloudberry Database - Next generation unified database for Analytics and AI
A curated list of open source tools used in analytical stacks and data engineering ecosystem
Easily sync your Postgres database to a Snowflake, ClickHouse, or DuckDB warehouse.
Snowflake infrastructure-as-code. Provision environments, automate deploys, CI/CD. Manage RBAC, users, roles, and data access. Declarative Python Resource API. Change Management tool for the Snowflake...
data load tool (dlt) is an open source Python library that makes data loading easy 🛠️
🔥🔥🔥 Open Source Alternative to Hightouch, Census, and RudderStack - Reverse ETL & Customer Data Platform (CDP)
This is a template you can use for your next data engineering portfolio project.
DomainMOD is an open source application written in PHP & MySQL used to manage your domains and other internet assets in a central location. DomainMOD also includes a Data Warehouse framework that allo...
The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.
Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake development.
TensorBase is a new big data warehousing with modern efforts.
Useful scripts, udfs, views, and other utilities for migration and data warehouse operations in BigQuery.
Privacy and Security focused Segment-alternative, in Golang and React
Hydra: Column-oriented Postgres. Add scalable analytics to your project in minutes.
A curated list of awesome big data frameworks, ressources and other awesomeness.
data load tool (dlt) is an open source Python library that makes data loading easy 🛠️
A curated list of awesome big data frameworks, ressources and other awesomeness.
Cloudberry Database - Next generation unified database for Analytics and AI
The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.
Hydra: Column-oriented Postgres. Add scalable analytics to your project in minutes.
🔥🔥🔥 Open Source Alternative to Hightouch, Census, and RudderStack - Reverse ETL & Customer Data Platform (CDP)
Privacy and Security focused Segment-alternative, in Golang and React
Wren Engine is the backbone of the semantic layer - The semantic engine for LLMs, bringing business context to AI agents.
Snowflake infrastructure-as-code. Provision environments, automate deploys, CI/CD. Manage RBAC, users, roles, and data access. Declarative Python Resource API. Change Management tool for the Snowflake...
A curated list of open source tools used in analytical stacks and data engineering ecosystem
Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake development.
Useful scripts, udfs, views, and other utilities for migration and data warehouse operations in BigQuery.
Open source SQL Query Assistant service for Databases/Warehouses
Download content packages for SAP Analytics Cloud and SAP Datasphere. Find technical samples, best practices or business scenarios. Packages contain data models, visualisations and sample data (if ap...
Easily sync your Postgres database to a Snowflake, ClickHouse, or DuckDB warehouse.
Wren Engine is the backbone of the semantic layer - The semantic engine for LLMs, bringing business context to AI agents.
A curated list of open source tools used in analytical stacks and data engineering ecosystem
Cloudberry Database - Next generation unified database for Analytics and AI
Download content packages for SAP Analytics Cloud and SAP Datasphere. Find technical samples, best practices or business scenarios. Packages contain data models, visualisations and sample data (if ap...
Easily sync your Postgres database to a Snowflake, ClickHouse, or DuckDB warehouse.
Snowflake infrastructure-as-code. Provision environments, automate deploys, CI/CD. Manage RBAC, users, roles, and data access. Declarative Python Resource API. Change Management tool for the Snowflake...
data load tool (dlt) is an open source Python library that makes data loading easy 🛠️
🔥🔥🔥 Open Source Alternative to Hightouch, Census, and RudderStack - Reverse ETL & Customer Data Platform (CDP)
This is a template you can use for your next data engineering portfolio project.
The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.
Use sample content to explorer SAP Datasphere. The downloads contain sample data as CSV files, but could also include model / metadata information. See the README files for details.
Useful scripts, udfs, views, and other utilities for migration and data warehouse operations in BigQuery.
DomainMOD is an open source application written in PHP & MySQL used to manage your domains and other internet assets in a central location. DomainMOD also includes a Data Warehouse framework that allo...
Main repo including core data model, data marts, reference data, terminology, and the clinical concept library
Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake development.
Hydra: Column-oriented Postgres. Add scalable analytics to your project in minutes.
🔥🔥🔥 Open Source Alternative to Hightouch, Census, and RudderStack - Reverse ETL & Customer Data Platform (CDP)
A curated list of open source tools used in analytical stacks and data engineering ecosystem
Download content packages for SAP Analytics Cloud and SAP Datasphere. Find technical samples, best practices or business scenarios. Packages contain data models, visualisations and sample data (if ap...
data load tool (dlt) is an open source Python library that makes data loading easy 🛠️
Hydra: Column-oriented Postgres. Add scalable analytics to your project in minutes.
A curated list of awesome big data frameworks, ressources and other awesomeness.
🔥🔥🔥 Open Source Alternative to Hightouch, Census, and RudderStack - Reverse ETL & Customer Data Platform (CDP)
The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.
Privacy and Security focused Segment-alternative, in Golang and React
Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake development.
Snowflake infrastructure-as-code. Provision environments, automate deploys, CI/CD. Manage RBAC, users, roles, and data access. Declarative Python Resource API. Change Management tool for the Snowflake...
Open source SQL Query Assistant service for Databases/Warehouses
Useful scripts, udfs, views, and other utilities for migration and data warehouse operations in BigQuery.
Cloudberry Database - Next generation unified database for Analytics and AI
Unified storage framework for the entire machine learning lifecycle
Main repo including core data model, data marts, reference data, terminology, and the clinical concept library
This is a template you can use for your next data engineering portfolio project.
data load tool (dlt) is an open source Python library that makes data loading easy 🛠️
Snowflake infrastructure-as-code. Provision environments, automate deploys, CI/CD. Manage RBAC, users, roles, and data access. Declarative Python Resource API. Change Management tool for the Snowflake...
A curated list of open source tools used in analytical stacks and data engineering ecosystem
This is a template you can use for your next data engineering portfolio project.
Hydra: Column-oriented Postgres. Add scalable analytics to your project in minutes.
Main repo including core data model, data marts, reference data, terminology, and the clinical concept library
Easily sync your Postgres database to a Snowflake, ClickHouse, or DuckDB warehouse.
Define, govern, and model event data for warehouse-first product analytics.
The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.
Use sample content to explorer SAP Datasphere. The downloads contain sample data as CSV files, but could also include model / metadata information. See the README files for details.
Construct a modern data stack and orchestration the workflows to create high quality data for analytics and ML applications.
An end-to-end data pipeline which extracts divvy bikeshare data from web loads it into data lake and datawarehouse transforms it using dbt and finally , a dashboard to visualize the data using looker ...
Example project demonstrating deployment patterns for real-time streaming workflows with Prefect 2.0
Data Engineer with Python lecture notes from #datacamp.
Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake development.
Open source SQL Query Assistant service for Databases/Warehouses
Real-time Data Warehouse with Apache Flink & Apache Kafka & Apache Hudi