Trending repositories for topic bigquery
Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.
Apache Doris is an easy-to-use, high performance and unified analytics database.
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
Logica is a logic programming language that compiles to SQL. It runs on DuckDB, Google BigQuery, PostgreSQL and SQLite.
Modern and easy to use SQL client for MySQL, Postgres, SQLite, SQL Server, and more. Linux, MacOS, and Windows.
OceanBase is an enterprise distributed relational database with high availability, high performance, horizontal scalability, and compatibility with SQL standards.
SWIRL AI Connect: AI infrastructure software that powers your Search & Retrieval Augmented Generation (RAG) applications. Simplify and enhance your AI pipelines with seamless integration of large lang...
📊 Cube — Universal semantic layer platform for AI, BI, spreadsheets, and embedded analytics
🚀 An open-source SQL AI (Text-to-SQL) Agent that empowers data, product teams to chat with their data. 🤘
Dashboards and notebooks in a single place. Create powerful and flexible dashboards using code, or build beautiful Notion-like notebooks and share them with your team.
Cloud Dataflow Google-provided templates for solving in-Cloud data tasks
HTTP Archive's annual "State of the Web" report made by the web community
Privacy and Security focused Segment-alternative, in Golang and React
Firehose is an extensible, no-code, and cloud-native service to load real-time streaming data from Kafka to data stores, data lakes, and analytical storage systems.
Logica is a logic programming language that compiles to SQL. It runs on DuckDB, Google BigQuery, PostgreSQL and SQLite.
SWIRL AI Connect: AI infrastructure software that powers your Search & Retrieval Augmented Generation (RAG) applications. Simplify and enhance your AI pipelines with seamless integration of large lang...
🚀 An open-source SQL AI (Text-to-SQL) Agent that empowers data, product teams to chat with their data. 🤘
HTTP Archive's annual "State of the Web" report made by the web community
Cloud Dataflow Google-provided templates for solving in-Cloud data tasks
Apache Doris is an easy-to-use, high performance and unified analytics database.
Firehose is an extensible, no-code, and cloud-native service to load real-time streaming data from Kafka to data stores, data lakes, and analytical storage systems.
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
Dashboards and notebooks in a single place. Create powerful and flexible dashboards using code, or build beautiful Notion-like notebooks and share them with your team.
Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.
Useful scripts, udfs, views, and other utilities for migration and data warehouse operations in BigQuery.
Modern and easy to use SQL client for MySQL, Postgres, SQLite, SQL Server, and more. Linux, MacOS, and Windows.
📊 Cube — Universal semantic layer platform for AI, BI, spreadsheets, and embedded analytics
Logica is a logic programming language that compiles to SQL. It runs on DuckDB, Google BigQuery, PostgreSQL and SQLite.
Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
Apache Doris is an easy-to-use, high performance and unified analytics database.
SWIRL AI Connect: AI infrastructure software that powers your Search & Retrieval Augmented Generation (RAG) applications. Simplify and enhance your AI pipelines with seamless integration of large lang...
Modern and easy to use SQL client for MySQL, Postgres, SQLite, SQL Server, and more. Linux, MacOS, and Windows.
Dashboards and notebooks in a single place. Create powerful and flexible dashboards using code, or build beautiful Notion-like notebooks and share them with your team.
OceanBase is an enterprise distributed relational database with high availability, high performance, horizontal scalability, and compatibility with SQL standards.
🚀 An open-source SQL AI (Text-to-SQL) Agent that empowers data, product teams to chat with their data. 🤘
🔥🔥🔥 Open Source Alternative to Hightouch, Census, and RudderStack - Reverse ETL & Data Activation
Jitsu is an open-source Segment alternative. Fully-scriptable data ingestion engine for modern data teams. Set-up a real-time data pipeline in minutes, not days
📊 Cube — Universal semantic layer platform for AI, BI, spreadsheets, and embedded analytics
Privacy and Security focused Segment-alternative, in Golang and React
Fast, Simple and a cost effective tool to replicate data from Postgres to Data Warehouses, Queues and Storage
Cloud Dataflow Google-provided templates for solving in-Cloud data tasks
Logica is a logic programming language that compiles to SQL. It runs on DuckDB, Google BigQuery, PostgreSQL and SQLite.
Declarative text based tool for data analysts and engineers to extract, load, transform and orchestrate their data pipelines.
SWIRL AI Connect: AI infrastructure software that powers your Search & Retrieval Augmented Generation (RAG) applications. Simplify and enhance your AI pipelines with seamless integration of large lang...
Build data pipelines with SQL and Python, ingest data from different sources, add quality checks, and build end-to-end flows.
Firehose is an extensible, no-code, and cloud-native service to load real-time streaming data from Kafka to data stores, data lakes, and analytical storage systems.
🚀 An open-source SQL AI (Text-to-SQL) Agent that empowers data, product teams to chat with their data. 🤘
Dashboards and notebooks in a single place. Create powerful and flexible dashboards using code, or build beautiful Notion-like notebooks and share them with your team.
🔥🔥🔥 Open Source Alternative to Hightouch, Census, and RudderStack - Reverse ETL & Data Activation
Data Foundation - Google Cloud Cortex Framework
Apache Doris is an easy-to-use, high performance and unified analytics database.
Cloud Dataflow Google-provided templates for solving in-Cloud data tasks
HTTP Archive's annual "State of the Web" report made by the web community
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
An end to end demo of Google's Cloud data and analytic stack.
Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.
Fast, Simple and a cost effective tool to replicate data from Postgres to Data Warehouses, Queues and Storage
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
Modern and easy to use SQL client for MySQL, Postgres, SQLite, SQL Server, and more. Linux, MacOS, and Windows.
Dashboards and notebooks in a single place. Create powerful and flexible dashboards using code, or build beautiful Notion-like notebooks and share them with your team.
Apache Doris is an easy-to-use, high performance and unified analytics database.
Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.
🔥🔥🔥 Open Source Alternative to Hightouch, Census, and RudderStack - Reverse ETL & Data Activation
SWIRL AI Connect: AI infrastructure software that powers your Search & Retrieval Augmented Generation (RAG) applications. Simplify and enhance your AI pipelines with seamless integration of large lang...
🚀 An open-source SQL AI (Text-to-SQL) Agent that empowers data, product teams to chat with their data. 🤘
Logica is a logic programming language that compiles to SQL. It runs on DuckDB, Google BigQuery, PostgreSQL and SQLite.
OceanBase is an enterprise distributed relational database with high availability, high performance, horizontal scalability, and compatibility with SQL standards.
📊 Cube — Universal semantic layer platform for AI, BI, spreadsheets, and embedded analytics
Fast, Simple and a cost effective tool to replicate data from Postgres to Data Warehouses, Queues and Storage
Jitsu is an open-source Segment alternative. Fully-scriptable data ingestion engine for modern data teams. Set-up a real-time data pipeline in minutes, not days
Blazing fast, instant realtime GraphQL APIs on your DB with fine grained access control, also trigger webhooks on database events.
The open source high performance ELT framework powered by Apache Arrow
Declarative text based tool for data analysts and engineers to extract, load, transform and orchestrate their data pipelines.
🔥🔥🔥 Open Source Alternative to Hightouch, Census, and RudderStack - Reverse ETL & Data Activation
Build data pipelines with SQL and Python, ingest data from different sources, add quality checks, and build end-to-end flows.
SWIRL AI Connect: AI infrastructure software that powers your Search & Retrieval Augmented Generation (RAG) applications. Simplify and enhance your AI pipelines with seamless integration of large lang...
Logica is a logic programming language that compiles to SQL. It runs on DuckDB, Google BigQuery, PostgreSQL and SQLite.
🚀 An open-source SQL AI (Text-to-SQL) Agent that empowers data, product teams to chat with their data. 🤘
Dashboards and notebooks in a single place. Create powerful and flexible dashboards using code, or build beautiful Notion-like notebooks and share them with your team.
Solution for documenting & administration Events, Parameters & Annotations for Google Analytics 4 (GA4) using Google Sheet, BigQuery & Looker Studio.
This Repo contain details related to Data Engineering tech stacks in GCP
A free, open-source, web-based self-service BI tailor-made for clickhouse, google bigquery, mysql, postgresql, vertica
A Data Engineering project. Repository for backend infrastructure and Streamlit app files for a Premier League Dashboard.
Fast, Simple and a cost effective tool to replicate data from Postgres to Data Warehouses, Queues and Storage
An end to end demo of Google's Cloud data and analytic stack.
Main repo including core data model, data marts, reference data, terminology, and the clinical concept library
Python utilities for working with FHIR, including libraries to build simple, flat FHIR views in BigQuery.
Dashboards and notebooks in a single place. Create powerful and flexible dashboards using code, or build beautiful Notion-like notebooks and share them with your team.
ingestr is a CLI tool to copy data between any databases with a single command seamlessly.
🚀 An open-source SQL AI (Text-to-SQL) Agent that empowers data, product teams to chat with their data. 🤘
Solution for documenting & administration Events, Parameters & Annotations for Google Analytics 4 (GA4) using Google Sheet, BigQuery & Looker Studio.
Rewrite BigQuery, Redshift, Snowflake and Databricks queries into DuckDB compatible SQL (with deep transformation of functions, data types and format characters) using Java.
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
Dashboards and notebooks in a single place. Create powerful and flexible dashboards using code, or build beautiful Notion-like notebooks and share them with your team.
Modern and easy to use SQL client for MySQL, Postgres, SQLite, SQL Server, and more. Linux, MacOS, and Windows.
Apache Doris is an easy-to-use, high performance and unified analytics database.
ingestr is a CLI tool to copy data between any databases with a single command seamlessly.
Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.
🚀 An open-source SQL AI (Text-to-SQL) Agent that empowers data, product teams to chat with their data. 🤘
OceanBase is an enterprise distributed relational database with high availability, high performance, horizontal scalability, and compatibility with SQL standards.
🔥🔥🔥 Open Source Alternative to Hightouch, Census, and RudderStack - Reverse ETL & Data Activation
📊 Cube — Universal semantic layer platform for AI, BI, spreadsheets, and embedded analytics
Fast, Simple and a cost effective tool to replicate data from Postgres to Data Warehouses, Queues and Storage
SWIRL AI Connect: AI infrastructure software that powers your Search & Retrieval Augmented Generation (RAG) applications. Simplify and enhance your AI pipelines with seamless integration of large lang...
Blazing fast, instant realtime GraphQL APIs on your DB with fine grained access control, also trigger webhooks on database events.
The open source high performance ELT framework powered by Apache Arrow
Jitsu is an open-source Segment alternative. Fully-scriptable data ingestion engine for modern data teams. Set-up a real-time data pipeline in minutes, not days
The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.
Dashboards and notebooks in a single place. Create powerful and flexible dashboards using code, or build beautiful Notion-like notebooks and share them with your team.
Build data pipelines with SQL and Python, ingest data from different sources, add quality checks, and build end-to-end flows.
dataform-ga4-sessions is a Dataform package to prepare session and event tables from Google Analytics 4 (GA4) BigQuery raw data
A Data Engineering project. Repository for backend infrastructure and Streamlit app files for a Premier League Dashboard.
Declarative text based tool for data analysts and engineers to extract, load, transform and orchestrate their data pipelines.
Code/Notes for the Data Engineering Zoomcamp by DataTalksClub
Data Engineering - Metropolitan Transportation Authority (MTA) Subway Data Analysis
Fast, Simple and a cost effective tool to replicate data from Postgres to Data Warehouses, Queues and Storage
This solution provides an automated, serverless way to redact sensitive data from PDF files using Google Cloud Services like Data Loss Prevention (DLP), Cloud Workflows, and Cloud Run.
SWIRL AI Connect: AI infrastructure software that powers your Search & Retrieval Augmented Generation (RAG) applications. Simplify and enhance your AI pipelines with seamless integration of large lang...
Main repo including core data model, data marts, reference data, terminology, and the clinical concept library
A DuckDB extension to read data directly from databases supporting the ODBC interface