Trending repositories for topic data-warehouse
🦔 PostHog provides open-source web & product analytics, session recording, feature flagging and A/B testing that you can self-host. Get started - free.
One advanced and mature open-source MPP (Massively Parallel Processing) database. Open source alternative to Greenplum Database.
data load tool (dlt) is an open source Python library that makes data loading easy 🛠️
A curated list of awesome big data frameworks, ressources and other awesomeness.
Privacy and Security focused Segment-alternative, in Golang and React
Hydra: Column-oriented Postgres. Add scalable analytics to your project in minutes.
A curated list of open source tools used in analytics platforms and data engineering ecosystem
Useful scripts, udfs, views, and other utilities for migration and data warehouse operations in BigQuery.
Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake development.
🔥🔥🔥 Open Source Alternative to Hightouch, Census, and RudderStack - Reverse ETL & Data Activation
One advanced and mature open-source MPP (Massively Parallel Processing) database. Open source alternative to Greenplum Database.
A curated list of open source tools used in analytics platforms and data engineering ecosystem
data load tool (dlt) is an open source Python library that makes data loading easy 🛠️
🦔 PostHog provides open-source web & product analytics, session recording, feature flagging and A/B testing that you can self-host. Get started - free.
Useful scripts, udfs, views, and other utilities for migration and data warehouse operations in BigQuery.
Open source SQL Query Assistant service for Databases/Warehouses
A curated list of awesome big data frameworks, ressources and other awesomeness.
Privacy and Security focused Segment-alternative, in Golang and React
Hydra: Column-oriented Postgres. Add scalable analytics to your project in minutes.
Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake development.
🔥🔥🔥 Open Source Alternative to Hightouch, Census, and RudderStack - Reverse ETL & Data Activation
🦔 PostHog provides open-source web & product analytics, session recording, feature flagging and A/B testing that you can self-host. Get started - free.
One advanced and mature open-source MPP (Massively Parallel Processing) database. Open source alternative to Greenplum Database.
data load tool (dlt) is an open source Python library that makes data loading easy 🛠️
A curated list of awesome big data frameworks, ressources and other awesomeness.
Main repo including core data model, data marts, reference data, terminology, and the clinical concept library
Privacy and Security focused Segment-alternative, in Golang and React
Useful scripts, udfs, views, and other utilities for migration and data warehouse operations in BigQuery.
A curated list of open source tools used in analytics platforms and data engineering ecosystem
Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake development.
Hydra: Column-oriented Postgres. Add scalable analytics to your project in minutes.
🤖 The semantic engine for LLMs, bringing semantic context to AI agents. 🔥
Construct a modern data stack and orchestration the workflows to create high quality data for analytics and ML applications.
🔥🔥🔥 Open Source Alternative to Hightouch, Census, and RudderStack - Reverse ETL & Data Activation
2019新型冠状病毒疫情时间序列数据仓库 | COVID-19/2019-nCoV Infection Time Series Data Warehouse
One advanced and mature open-source MPP (Massively Parallel Processing) database. Open source alternative to Greenplum Database.
Main repo including core data model, data marts, reference data, terminology, and the clinical concept library
A curated list of open source tools used in analytics platforms and data engineering ecosystem
data load tool (dlt) is an open source Python library that makes data loading easy 🛠️
🤖 The semantic engine for LLMs, bringing semantic context to AI agents. 🔥
🦔 PostHog provides open-source web & product analytics, session recording, feature flagging and A/B testing that you can self-host. Get started - free.
Construct a modern data stack and orchestration the workflows to create high quality data for analytics and ML applications.
:whale: Tool to automate data quality checks on data pipelines
Useful scripts, udfs, views, and other utilities for migration and data warehouse operations in BigQuery.
Open source SQL Query Assistant service for Databases/Warehouses
Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake development.
A curated list of awesome big data frameworks, ressources and other awesomeness.
Privacy and Security focused Segment-alternative, in Golang and React
Hydra: Column-oriented Postgres. Add scalable analytics to your project in minutes.
🔥🔥🔥 Open Source Alternative to Hightouch, Census, and RudderStack - Reverse ETL & Data Activation
2019新型冠状病毒疫情时间序列数据仓库 | COVID-19/2019-nCoV Infection Time Series Data Warehouse
🦔 PostHog provides open-source web & product analytics, session recording, feature flagging and A/B testing that you can self-host. Get started - free.
One advanced and mature open-source MPP (Massively Parallel Processing) database. Open source alternative to Greenplum Database.
data load tool (dlt) is an open source Python library that makes data loading easy 🛠️
A curated list of awesome big data frameworks, ressources and other awesomeness.
Hydra: Column-oriented Postgres. Add scalable analytics to your project in minutes.
Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake development.
The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.
A curated list of open source tools used in analytics platforms and data engineering ecosystem
Privacy and Security focused Segment-alternative, in Golang and React
Titan Core - Snowflake infrastructure-as-code. Provision environments, automate deploys, CI/CD. Manage RBAC, users, roles, and data access. Declarative Python Resource API. Change Management tool for ...
Arcane Insight is a data analytics project designed to harness the power of SQLMesh & DuckDB to collect, transform, and analyze data from Blizzard’s Hearthstone API. Focused on card statistics and att...
Useful scripts, udfs, views, and other utilities for migration and data warehouse operations in BigQuery.
🤖 The semantic engine for LLMs, bringing semantic context to AI agents. 🔥
Main repo including core data model, data marts, reference data, terminology, and the clinical concept library
🔥🔥🔥 Open Source Alternative to Hightouch, Census, and RudderStack - Reverse ETL & Data Activation
DomainMOD is an open source application written in PHP & MySQL used to manage your domains and other internet assets in a central location. DomainMOD also includes a Data Warehouse framework that allo...
Arcane Insight is a data analytics project designed to harness the power of SQLMesh & DuckDB to collect, transform, and analyze data from Blizzard’s Hearthstone API. Focused on card statistics and att...
One advanced and mature open-source MPP (Massively Parallel Processing) database. Open source alternative to Greenplum Database.
A curated list of open source tools used in analytics platforms and data engineering ecosystem
🤖 The semantic engine for LLMs, bringing semantic context to AI agents. 🔥
data load tool (dlt) is an open source Python library that makes data loading easy 🛠️
Main repo including core data model, data marts, reference data, terminology, and the clinical concept library
Download content packages for SAP Analytics Cloud and SAP Datasphere. Find technical samples, best practices or business scenarios. Packages contain data models, visualisations and sample data (if ap...
Titan Core - Snowflake infrastructure-as-code. Provision environments, automate deploys, CI/CD. Manage RBAC, users, roles, and data access. Declarative Python Resource API. Change Management tool for ...
🦔 PostHog provides open-source web & product analytics, session recording, feature flagging and A/B testing that you can self-host. Get started - free.
Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake development.
Unified storage framework for the entire machine learning lifecycle
Easily sync your Postgres database to a Snowflake, ClickHouse, or DuckDB warehouse.
The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.
Useful scripts, udfs, views, and other utilities for migration and data warehouse operations in BigQuery.
Construct a modern data stack and orchestration the workflows to create high quality data for analytics and ML applications.
Hydra: Column-oriented Postgres. Add scalable analytics to your project in minutes.
A curated list of open source tools used in analytics platforms and data engineering ecosystem
三足乌数据中台融合数据规划、数据接入、数据开发、数据仓库、数据治理、数据资产、数据服务、数据运维、系统管理等功能模块为一体。打通数据壁垒,解决数据孤岛问题,实现数据的低代码可视化开发,助力政府、企业数字化转型。
Arcane Insight is a data analytics project designed to harness the power of SQLMesh & DuckDB to collect, transform, and analyze data from Blizzard’s Hearthstone API. Focused on card statistics and att...
🦔 PostHog provides open-source web & product analytics, session recording, feature flagging and A/B testing that you can self-host. Get started - free.
data load tool (dlt) is an open source Python library that makes data loading easy 🛠️
🔥🔥🔥 Open Source Alternative to Hightouch, Census, and RudderStack - Reverse ETL & Data Activation
A curated list of awesome big data frameworks, ressources and other awesomeness.
One advanced and mature open-source MPP (Massively Parallel Processing) database. Open source alternative to Greenplum Database.
Hydra: Column-oriented Postgres. Add scalable analytics to your project in minutes.
The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.
Titan Core - Snowflake infrastructure-as-code. Provision environments, automate deploys, CI/CD. Manage RBAC, users, roles, and data access. Declarative Python Resource API. Change Management tool for ...
Privacy and Security focused Segment-alternative, in Golang and React
Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake development.
Open source SQL Query Assistant service for Databases/Warehouses
Useful scripts, udfs, views, and other utilities for migration and data warehouse operations in BigQuery.
A curated list of open source tools used in analytics platforms and data engineering ecosystem
Unified storage framework for the entire machine learning lifecycle
🤖 The semantic engine for LLMs, bringing semantic context to AI agents. 🔥
A curated list of open source tools used in analytics platforms and data engineering ecosystem
三足乌数据中台融合数据规划、数据接入、数据开发、数据仓库、数据治理、数据资产、数据服务、数据运维、系统管理等功能模块为一体。打通数据壁垒,解决数据孤岛问题,实现数据的低代码可视化开发,助力政府、企业数字化转型。
One advanced and mature open-source MPP (Massively Parallel Processing) database. Open source alternative to Greenplum Database.
Titan Core - Snowflake infrastructure-as-code. Provision environments, automate deploys, CI/CD. Manage RBAC, users, roles, and data access. Declarative Python Resource API. Change Management tool for ...
Download content packages for SAP Analytics Cloud and SAP Datasphere. Find technical samples, best practices or business scenarios. Packages contain data models, visualisations and sample data (if ap...
🤖 The semantic engine for LLMs, bringing semantic context to AI agents. 🔥
data load tool (dlt) is an open source Python library that makes data loading easy 🛠️
Arcane Insight is a data analytics project designed to harness the power of SQLMesh & DuckDB to collect, transform, and analyze data from Blizzard’s Hearthstone API. Focused on card statistics and att...
An efficient storage and compute engine for both on-prem and cloud-native data analytics.
Data Engineering - Metropolitan Transportation Authority (MTA) Subway Data Analysis
This is a template you can use for your next data engineering portfolio project.
Easily sync your Postgres database to a Snowflake, ClickHouse, or DuckDB warehouse.
Main repo including core data model, data marts, reference data, terminology, and the clinical concept library
🦔 PostHog provides open-source web & product analytics, session recording, feature flagging and A/B testing that you can self-host. Get started - free.
Data Engineer with Python lecture notes from #datacamp.
Use sample content to explorer SAP Datasphere. The downloads contain sample data as CSV files, but could also include model / metadata information. See the README files for details.
Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake development.