Statistics for topic data-analytics
RepositoryStats tracks 595,858 Github repositories, of these 133 are tagged with the data-analytics topic. The most common primary language for repositories using this topic is Python (33). Other languages include: Jupyter Notebook (25), TypeScript (12)
Stargazers over time for topic data-analytics
Most starred repositories for topic data-analytics (view more)
Trending repositories for topic data-analytics (view more)
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
Apache Superset is a Data Visualization and Data Exploration Platform
Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckD...
A curated list of awesome big data frameworks, ressources and other awesomeness.
Lean and mean distributed stream processing system written in rust and web assembly. Alternative to Kafka + Flink in one.
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
A roadmap to guide you through mastering SQL for Data Science in just 6 weeks for free
A curated list of open source tools used in analytics platforms and data engineering ecosystem
MinusX is an AI Data Scientist for Analytics Apps you already use and love. Currently it supports Jupyter, Metabase, Google Sheets & Posthog.
AI-Driven Research Assistant: An advanced multi-agent system for automating complex research processes. Leveraging LangChain, OpenAI GPT, and LangGraph, this tool streamlines hypothesis generation, da...
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
Apache Superset is a Data Visualization and Data Exploration Platform
Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckD...
AI-Driven Research Assistant: An advanced multi-agent system for automating complex research processes. Leveraging LangChain, OpenAI GPT, and LangGraph, this tool streamlines hypothesis generation, da...
A simple package to abstract away the process of creating usable DataFrames for data analytics. This package is heavily inspired by the amazing Python library, Pandas.
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
A roadmap to guide you through mastering SQL for Data Science in just 6 weeks for free
Main repo including core data model, data marts, reference data, terminology, and the clinical concept library
AI-Driven Research Assistant: An advanced multi-agent system for automating complex research processes. Leveraging LangChain, OpenAI GPT, and LangGraph, this tool streamlines hypothesis generation, da...
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
Apache Superset is a Data Visualization and Data Exploration Platform
AI-Driven Research Assistant: An advanced multi-agent system for automating complex research processes. Leveraging LangChain, OpenAI GPT, and LangGraph, this tool streamlines hypothesis generation, da...
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
AI-Driven Research Assistant: An advanced multi-agent system for automating complex research processes. Leveraging LangChain, OpenAI GPT, and LangGraph, this tool streamlines hypothesis generation, da...
🦖 A SQL-on-everything Query Engine you can execute over multiple databases and file formats. Query your data, where it lives.
Discover a curated collection of dynamic Power BI dashboards covering financial analytics, HR metrics, streaming service trends, real estate dynamics, and more. Meticulously designed for comprehensive...
A simple package to abstract away the process of creating usable DataFrames for data analytics. This package is heavily inspired by the amazing Python library, Pandas.
AI-Driven Research Assistant: An advanced multi-agent system for automating complex research processes. Leveraging LangChain, OpenAI GPT, and LangGraph, this tool streamlines hypothesis generation, da...
A roadmap to guide you through mastering SQL for Data Science in just 6 weeks for free
Apache Superset is a Data Visualization and Data Exploration Platform
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
Lean and mean distributed stream processing system written in rust and web assembly. Alternative to Kafka + Flink in one.
A curated list of open source tools used in analytics platforms and data engineering ecosystem
MinusX is an AI Data Scientist for Analytics Apps you already use and love. Currently it supports Jupyter, Metabase, Google Sheets & Posthog.
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.