Trending repositories for topic data-warehouse
🦔 PostHog provides open-source web & product analytics, session recording, feature flagging and A/B testing that you can self-host. Get started - free.
One advanced and mature open-source MPP (Massively Parallel Processing) database. Open source alternative to Greenplum Database.
data load tool (dlt) is an open source Python library that makes data loading easy 🛠️
A curated list of awesome big data frameworks, ressources and other awesomeness.
Hydra: Column-oriented Postgres. Add scalable analytics to your project in minutes.
The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.
Privacy and Security focused Segment-alternative, in Golang and React
Main repo including core data model, data marts, reference data, terminology, and the clinical concept library
One advanced and mature open-source MPP (Massively Parallel Processing) database. Open source alternative to Greenplum Database.
Main repo including core data model, data marts, reference data, terminology, and the clinical concept library
🦔 PostHog provides open-source web & product analytics, session recording, feature flagging and A/B testing that you can self-host. Get started - free.
data load tool (dlt) is an open source Python library that makes data loading easy 🛠️
The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.
Hydra: Column-oriented Postgres. Add scalable analytics to your project in minutes.
Privacy and Security focused Segment-alternative, in Golang and React
A curated list of awesome big data frameworks, ressources and other awesomeness.
🦔 PostHog provides open-source web & product analytics, session recording, feature flagging and A/B testing that you can self-host. Get started - free.
data load tool (dlt) is an open source Python library that makes data loading easy 🛠️
One advanced and mature open-source MPP (Massively Parallel Processing) database. Open source alternative to Greenplum Database.
Hydra: Column-oriented Postgres. Add scalable analytics to your project in minutes.
A curated list of awesome big data frameworks, ressources and other awesomeness.
The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.
Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake development.
Useful scripts, udfs, views, and other utilities for migration and data warehouse operations in BigQuery.
A curated list of open source tools used in analytics platforms and data engineering ecosystem
Privacy and Security focused Segment-alternative, in Golang and React
🤖 The semantic engine for LLMs, bringing semantic context to AI agents. 🔥
Main repo including core data model, data marts, reference data, terminology, and the clinical concept library
Titan Core - Snowflake infrastructure-as-code. Provision environments, automate deploys, CI/CD. Manage RBAC, users, roles, and data access. Declarative Python Resource API. Change Management tool for ...
TensorBase is a new big data warehousing with modern efforts.
One advanced and mature open-source MPP (Massively Parallel Processing) database. Open source alternative to Greenplum Database.
A curated list of open source tools used in analytics platforms and data engineering ecosystem
data load tool (dlt) is an open source Python library that makes data loading easy 🛠️
🤖 The semantic engine for LLMs, bringing semantic context to AI agents. 🔥
🦔 PostHog provides open-source web & product analytics, session recording, feature flagging and A/B testing that you can self-host. Get started - free.
Main repo including core data model, data marts, reference data, terminology, and the clinical concept library
The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.
Hydra: Column-oriented Postgres. Add scalable analytics to your project in minutes.
Useful scripts, udfs, views, and other utilities for migration and data warehouse operations in BigQuery.
Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake development.
Titan Core - Snowflake infrastructure-as-code. Provision environments, automate deploys, CI/CD. Manage RBAC, users, roles, and data access. Declarative Python Resource API. Change Management tool for ...
Open source SQL Query Assistant service for Databases/Warehouses
A curated list of awesome big data frameworks, ressources and other awesomeness.
TensorBase is a new big data warehousing with modern efforts.
Privacy and Security focused Segment-alternative, in Golang and React
🦔 PostHog provides open-source web & product analytics, session recording, feature flagging and A/B testing that you can self-host. Get started - free.
One advanced and mature open-source MPP (Massively Parallel Processing) database. Open source alternative to Greenplum Database.
data load tool (dlt) is an open source Python library that makes data loading easy 🛠️
A curated list of awesome big data frameworks, ressources and other awesomeness.
The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.
Hydra: Column-oriented Postgres. Add scalable analytics to your project in minutes.
Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake development.
Privacy and Security focused Segment-alternative, in Golang and React
A curated list of open source tools used in analytics platforms and data engineering ecosystem
Titan Core - Snowflake infrastructure-as-code. Provision environments, automate deploys, CI/CD. Manage RBAC, users, roles, and data access. Declarative Python Resource API. Change Management tool for ...
🔥🔥🔥 Open Source Alternative to Hightouch, Census, and RudderStack - Reverse ETL & Data Activation
Arcane Insight is a data analytics project designed to harness the power of SQLMesh & DuckDB to collect, transform, and analyze data from Blizzard’s Hearthstone API. Focused on card statistics and att...
Useful scripts, udfs, views, and other utilities for migration and data warehouse operations in BigQuery.
🤖 The semantic engine for LLMs, bringing semantic context to AI agents. 🔥
DomainMOD is an open source application written in PHP & MySQL used to manage your domains and other internet assets in a central location. DomainMOD also includes a Data Warehouse framework that allo...
Arcane Insight is a data analytics project designed to harness the power of SQLMesh & DuckDB to collect, transform, and analyze data from Blizzard’s Hearthstone API. Focused on card statistics and att...
One advanced and mature open-source MPP (Massively Parallel Processing) database. Open source alternative to Greenplum Database.
A curated list of open source tools used in analytics platforms and data engineering ecosystem
data load tool (dlt) is an open source Python library that makes data loading easy 🛠️
🤖 The semantic engine for LLMs, bringing semantic context to AI agents. 🔥
Download content packages for SAP Analytics Cloud and SAP Datasphere. Find technical samples, best practices or business scenarios. Packages contain data models, visualisations and sample data (if ap...
Titan Core - Snowflake infrastructure-as-code. Provision environments, automate deploys, CI/CD. Manage RBAC, users, roles, and data access. Declarative Python Resource API. Change Management tool for ...
🦔 PostHog provides open-source web & product analytics, session recording, feature flagging and A/B testing that you can self-host. Get started - free.
Main repo including core data model, data marts, reference data, terminology, and the clinical concept library
Construct a modern data stack and orchestration the workflows to create high quality data for analytics and ML applications.
This is a template you can use for your next data engineering portfolio project.
The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.
Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake development.
Use sample content to explorer SAP Datasphere. The downloads contain sample data as CSV files, but could also include model / metadata information. See the README files for details.
Unified storage framework for the entire machine learning lifecycle
DomainMOD is an open source application written in PHP & MySQL used to manage your domains and other internet assets in a central location. DomainMOD also includes a Data Warehouse framework that allo...
A curated list of open source tools used in analytics platforms and data engineering ecosystem
三足乌数据中台融合数据规划、数据接入、数据开发、数据仓库、数据治理、数据资产、数据服务、数据运维、系统管理等功能模块为一体。打通数据壁垒,解决数据孤岛问题,实现数据的低代码可视化开发,助力政府、企业数字化转型。
Arcane Insight is a data analytics project designed to harness the power of SQLMesh & DuckDB to collect, transform, and analyze data from Blizzard’s Hearthstone API. Focused on card statistics and att...
🦔 PostHog provides open-source web & product analytics, session recording, feature flagging and A/B testing that you can self-host. Get started - free.
data load tool (dlt) is an open source Python library that makes data loading easy 🛠️
🔥🔥🔥 Open Source Alternative to Hightouch, Census, and RudderStack - Reverse ETL & Data Activation
A curated list of awesome big data frameworks, ressources and other awesomeness.
One advanced and mature open-source MPP (Massively Parallel Processing) database. Open source alternative to Greenplum Database.
Hydra: Column-oriented Postgres. Add scalable analytics to your project in minutes.
The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.
Titan Core - Snowflake infrastructure-as-code. Provision environments, automate deploys, CI/CD. Manage RBAC, users, roles, and data access. Declarative Python Resource API. Change Management tool for ...
Privacy and Security focused Segment-alternative, in Golang and React
Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake development.
Open source SQL Query Assistant service for Databases/Warehouses
Useful scripts, udfs, views, and other utilities for migration and data warehouse operations in BigQuery.
Unified storage framework for the entire machine learning lifecycle
A curated list of open source tools used in analytics platforms and data engineering ecosystem
🤖 The semantic engine for LLMs, bringing semantic context to AI agents. 🔥
A curated list of open source tools used in analytics platforms and data engineering ecosystem
三足乌数据中台融合数据规划、数据接入、数据开发、数据仓库、数据治理、数据资产、数据服务、数据运维、系统管理等功能模块为一体。打通数据壁垒,解决数据孤岛问题,实现数据的低代码可视化开发,助力政府、企业数字化转型。
One advanced and mature open-source MPP (Massively Parallel Processing) database. Open source alternative to Greenplum Database.
Titan Core - Snowflake infrastructure-as-code. Provision environments, automate deploys, CI/CD. Manage RBAC, users, roles, and data access. Declarative Python Resource API. Change Management tool for ...
Download content packages for SAP Analytics Cloud and SAP Datasphere. Find technical samples, best practices or business scenarios. Packages contain data models, visualisations and sample data (if ap...
🤖 The semantic engine for LLMs, bringing semantic context to AI agents. 🔥
data load tool (dlt) is an open source Python library that makes data loading easy 🛠️
Arcane Insight is a data analytics project designed to harness the power of SQLMesh & DuckDB to collect, transform, and analyze data from Blizzard’s Hearthstone API. Focused on card statistics and att...
An efficient storage and compute engine for both on-prem and cloud-native data analytics.
This is a template you can use for your next data engineering portfolio project.
Data Engineering - Metropolitan Transportation Authority (MTA) Subway Data Analysis
Easily sync your Postgres database to a Snowflake, ClickHouse, or DuckDB warehouse.
Main repo including core data model, data marts, reference data, terminology, and the clinical concept library
🦔 PostHog provides open-source web & product analytics, session recording, feature flagging and A/B testing that you can self-host. Get started - free.
Data Engineer with Python lecture notes from #datacamp.
Example project demonstrating deployment patterns for real-time streaming workflows with Prefect 2.0
Use sample content to explorer SAP Datasphere. The downloads contain sample data as CSV files, but could also include model / metadata information. See the README files for details.