Trending repositories for topic data-warehouse
🦔 PostHog provides open-source web & product analytics, session recording, feature flagging and A/B testing that you can self-host. Get started - free.
data load tool (dlt) is an open source Python library that makes data loading easy 🛠️
One advanced and mature open-source MPP (Massively Parallel Processing) database. Open source alternative to Greenplum Database.
Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake development.
A curated list of awesome big data frameworks, ressources and other awesomeness.
Privacy and Security focused Segment-alternative, in Golang and React
Titan Core - Snowflake infrastructure-as-code. Provision environments, automate deploys, CI/CD. Manage RBAC, users, roles, and data access. Declarative Python Resource API. Change Management tool for ...
Useful scripts, udfs, views, and other utilities for migration and data warehouse operations in BigQuery.
TensorBase is a new big data warehousing with modern efforts.
One advanced and mature open-source MPP (Massively Parallel Processing) database. Open source alternative to Greenplum Database.
data load tool (dlt) is an open source Python library that makes data loading easy 🛠️
🦔 PostHog provides open-source web & product analytics, session recording, feature flagging and A/B testing that you can self-host. Get started - free.
Titan Core - Snowflake infrastructure-as-code. Provision environments, automate deploys, CI/CD. Manage RBAC, users, roles, and data access. Declarative Python Resource API. Change Management tool for ...
Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake development.
Useful scripts, udfs, views, and other utilities for migration and data warehouse operations in BigQuery.
TensorBase is a new big data warehousing with modern efforts.
Privacy and Security focused Segment-alternative, in Golang and React
A curated list of awesome big data frameworks, ressources and other awesomeness.
🦔 PostHog provides open-source web & product analytics, session recording, feature flagging and A/B testing that you can self-host. Get started - free.
data load tool (dlt) is an open source Python library that makes data loading easy 🛠️
One advanced and mature open-source MPP (Massively Parallel Processing) database. Open source alternative to Greenplum Database.
🔥🔥🔥 Open Source Alternative to Hightouch, Census, and RudderStack - Reverse ETL & Data Activation
Privacy and Security focused Segment-alternative, in Golang and React
Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake development.
The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.
A curated list of awesome big data frameworks, ressources and other awesomeness.
A curated list of open source tools used in analytics platforms and data engineering ecosystem
Titan Core - Snowflake infrastructure-as-code. Provision environments, automate deploys, CI/CD. Manage RBAC, users, roles, and data access. Declarative Python Resource API. Change Management tool for ...
🤖 The semantic engine for LLMs, bringing semantic context to AI agents. 🔥
Construct a modern data stack and orchestration the workflows to create high quality data for analytics and ML applications.
This is a template you can use for your next data engineering portfolio project.
Useful scripts, udfs, views, and other utilities for migration and data warehouse operations in BigQuery.
TensorBase is a new big data warehousing with modern efforts.
Hydra: Column-oriented Postgres. Add scalable analytics to your project in minutes.
One advanced and mature open-source MPP (Massively Parallel Processing) database. Open source alternative to Greenplum Database.
data load tool (dlt) is an open source Python library that makes data loading easy 🛠️
A curated list of open source tools used in analytics platforms and data engineering ecosystem
🤖 The semantic engine for LLMs, bringing semantic context to AI agents. 🔥
Construct a modern data stack and orchestration the workflows to create high quality data for analytics and ML applications.
Titan Core - Snowflake infrastructure-as-code. Provision environments, automate deploys, CI/CD. Manage RBAC, users, roles, and data access. Declarative Python Resource API. Change Management tool for ...
🔥🔥🔥 Open Source Alternative to Hightouch, Census, and RudderStack - Reverse ETL & Data Activation
This is a template you can use for your next data engineering portfolio project.
🦔 PostHog provides open-source web & product analytics, session recording, feature flagging and A/B testing that you can self-host. Get started - free.
Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake development.
The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.
Privacy and Security focused Segment-alternative, in Golang and React
Useful scripts, udfs, views, and other utilities for migration and data warehouse operations in BigQuery.
TensorBase is a new big data warehousing with modern efforts.
A curated list of awesome big data frameworks, ressources and other awesomeness.
Hydra: Column-oriented Postgres. Add scalable analytics to your project in minutes.
🦔 PostHog provides open-source web & product analytics, session recording, feature flagging and A/B testing that you can self-host. Get started - free.
🔥🔥🔥 Open Source Alternative to Hightouch, Census, and RudderStack - Reverse ETL & Data Activation
data load tool (dlt) is an open source Python library that makes data loading easy 🛠️
A curated list of awesome big data frameworks, ressources and other awesomeness.
One advanced and mature open-source MPP (Massively Parallel Processing) database. Open source alternative to Greenplum Database.
A curated list of open source tools used in analytics platforms and data engineering ecosystem
The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.
Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake development.
Hydra: Column-oriented Postgres. Add scalable analytics to your project in minutes.
Privacy and Security focused Segment-alternative, in Golang and React
Titan Core - Snowflake infrastructure-as-code. Provision environments, automate deploys, CI/CD. Manage RBAC, users, roles, and data access. Declarative Python Resource API. Change Management tool for ...
Open source SQL Query Assistant service for Databases/Warehouses
Useful scripts, udfs, views, and other utilities for migration and data warehouse operations in BigQuery.
🤖 The semantic engine for LLMs, bringing semantic context to AI agents. 🔥
Main repo including core data model, data marts, reference data, terminology, and the clinical concept library
A curated list of open source tools used in analytics platforms and data engineering ecosystem
🔥🔥🔥 Open Source Alternative to Hightouch, Census, and RudderStack - Reverse ETL & Data Activation
One advanced and mature open-source MPP (Massively Parallel Processing) database. Open source alternative to Greenplum Database.
🤖 The semantic engine for LLMs, bringing semantic context to AI agents. 🔥
三足乌数据中台融合数据规划、数据接入、数据开发、数据仓库、数据治理、数据资产、数据服务、数据运维、系统管理等功能模块为一体。打通数据壁垒,解决数据孤岛问题,实现数据的低代码可视化开发,助力政府、企业数字化转型。
data load tool (dlt) is an open source Python library that makes data loading easy 🛠️
Titan Core - Snowflake infrastructure-as-code. Provision environments, automate deploys, CI/CD. Manage RBAC, users, roles, and data access. Declarative Python Resource API. Change Management tool for ...
Download content packages for SAP Analytics Cloud and SAP Datasphere. Find technical samples, best practices or business scenarios. Packages contain data models, visualisations and sample data (if ap...
An efficient storage and compute engine for both on-prem and cloud-native data analytics.
Main repo including core data model, data marts, reference data, terminology, and the clinical concept library
This is a template you can use for your next data engineering portfolio project.
🦔 PostHog provides open-source web & product analytics, session recording, feature flagging and A/B testing that you can self-host. Get started - free.
Construct a modern data stack and orchestration the workflows to create high quality data for analytics and ML applications.
Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake development.
Analytics - Open source data warehouse and reporting for Nextcloud
The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.
Use sample content to explorer SAP Datasphere. The downloads contain sample data as CSV files, but could also include model / metadata information. See the README files for details.
A curated list of open source tools used in analytics platforms and data engineering ecosystem
三足乌数据中台融合数据规划、数据接入、数据开发、数据仓库、数据治理、数据资产、数据服务、数据运维、系统管理等功能模块为一体。打通数据壁垒,解决数据孤岛问题,实现数据的低代码可视化开发,助力政府、企业数字化转型。
🦔 PostHog provides open-source web & product analytics, session recording, feature flagging and A/B testing that you can self-host. Get started - free.
data load tool (dlt) is an open source Python library that makes data loading easy 🛠️
🔥🔥🔥 Open Source Alternative to Hightouch, Census, and RudderStack - Reverse ETL & Data Activation
A curated list of awesome big data frameworks, ressources and other awesomeness.
Hydra: Column-oriented Postgres. Add scalable analytics to your project in minutes.
The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.
Titan Core - Snowflake infrastructure-as-code. Provision environments, automate deploys, CI/CD. Manage RBAC, users, roles, and data access. Declarative Python Resource API. Change Management tool for ...
Privacy and Security focused Segment-alternative, in Golang and React
One advanced and mature open-source MPP (Massively Parallel Processing) database. Open source alternative to Greenplum Database.
Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake development.
Open source SQL Query Assistant service for Databases/Warehouses
Useful scripts, udfs, views, and other utilities for migration and data warehouse operations in BigQuery.
Unified storage framework for the entire machine learning lifecycle
A curated list of open source tools used in analytics platforms and data engineering ecosystem
🤖 The semantic engine for LLMs, bringing semantic context to AI agents. 🔥
A curated list of open source tools used in analytics platforms and data engineering ecosystem
三足乌数据中台融合数据规划、数据接入、数据开发、数据仓库、数据治理、数据资产、数据服务、数据运维、系统管理等功能模块为一体。打通数据壁垒,解决数据孤岛问题,实现数据的低代码可视化开发,助力政府、企业数字化转型。
One advanced and mature open-source MPP (Massively Parallel Processing) database. Open source alternative to Greenplum Database.
Titan Core - Snowflake infrastructure-as-code. Provision environments, automate deploys, CI/CD. Manage RBAC, users, roles, and data access. Declarative Python Resource API. Change Management tool for ...
Download content packages for SAP Analytics Cloud and SAP Datasphere. Find technical samples, best practices or business scenarios. Packages contain data models, visualisations and sample data (if ap...
🤖 The semantic engine for LLMs, bringing semantic context to AI agents. 🔥
data load tool (dlt) is an open source Python library that makes data loading easy 🛠️
An efficient storage and compute engine for both on-prem and cloud-native data analytics.
This is a template you can use for your next data engineering portfolio project.
Data Engineering - Metropolitan Transportation Authority (MTA) Subway Data Analysis
Easily sync your Postgres database to a Snowflake, ClickHouse, or DuckDB warehouse.
Main repo including core data model, data marts, reference data, terminology, and the clinical concept library
🦔 PostHog provides open-source web & product analytics, session recording, feature flagging and A/B testing that you can self-host. Get started - free.
Data Engineer with Python lecture notes from #datacamp.
Use sample content to explorer SAP Datasphere. The downloads contain sample data as CSV files, but could also include model / metadata information. See the README files for details.
Example project demonstrating deployment patterns for real-time streaming workflows with Prefect 2.0
The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.