Statistics for topic data-analytics
RepositoryStats tracks 650,729 Github repositories, of these 149 are tagged with the data-analytics topic. The most common primary language for repositories using this topic is Python (41). Other languages include: Jupyter Notebook (28), TypeScript (11)
Stargazers over time for topic data-analytics
Most starred repositories for topic data-analytics (view more)
Trending repositories for topic data-analytics (view more)
Apache Superset is a Data Visualization and Data Exploration Platform
Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckD...
A comprehensive guide to building a modern data warehouse with SQL Server, including ETL processes, data modeling, and analytics.
About The most comprehensive SQL guide from a real-world expert! Learn everything from basics to advanced queries, optimizations, and real-world SQL
A comprehensive guide to building a modern data warehouse with SQL Server, including ETL processes, data modeling, and analytics.
A curated list of open source tools used in analytics platforms and data engineering ecosystem
A MCP (Model Context Protocol) server for interacting with dbt.
🤖 The Semantic Engine for Model Context Protocol(MCP) Clients and AI Agents 🔥
Apache Superset is a Data Visualization and Data Exploration Platform
Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckD...
A comprehensive guide to building a modern data warehouse with SQL Server, including ETL processes, data modeling, and analytics.
A MCP (Model Context Protocol) server for interacting with dbt.
About The most comprehensive SQL guide from a real-world expert! Learn everything from basics to advanced queries, optimizations, and real-world SQL
A comprehensive guide to building a modern data warehouse with SQL Server, including ETL processes, data modeling, and analytics.
A MCP (Model Context Protocol) server for interacting with dbt.
This repository contains a SQL dataset of a music store and SQL queries to answer questions about the data. The results of the SQL queries can be found in the analysis.sql file. This repository can b...
Apache Superset is a Data Visualization and Data Exploration Platform
🦀 event stream processing for developers to collect and transform data in motion to power responsive data intensive applications.
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckD...
A MCP (Model Context Protocol) server for interacting with dbt.
A MCP (Model Context Protocol) server for interacting with dbt.
About The most comprehensive SQL guide from a real-world expert! Learn everything from basics to advanced queries, optimizations, and real-world SQL
A comprehensive guide to building a modern data warehouse with SQL Server, including ETL processes, data modeling, and analytics.
This repository contains a collection of SQL scripts demonstrating various analytical techniques, such as changes over time, cumulative, performance, data segmentation, part-to-whole analysis.
DATAGEN: AI-driven multi-agent research assistant automating hypothesis generation, data analysis, and report writing. Now expanding into crypto market intelligence. Learn more: https://datagen.digita...
A roadmap to guide you through mastering SQL for Data Science in just 6 weeks for free
MinusX is an AI Data Scientist for Analytics Apps you already use and love. Currently it supports Jupyter, Metabase, Google Sheets & Posthog.
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
Apache Superset is a Data Visualization and Data Exploration Platform
🦀 event stream processing for developers to collect and transform data in motion to power responsive data intensive applications.
Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckD...
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
MinusX is an AI Data Scientist for Analytics Apps you already use and love. Currently it supports Jupyter, Metabase, Google Sheets & Posthog.
ANJANA is a Python library for anonymizing sensitive data