Statistics for topic data-analysis
RepositoryStats tracks 595,858 Github repositories, of these 669 are tagged with the data-analysis topic. The most common primary language for repositories using this topic is Python (205). Other languages include: Jupyter Notebook (177), TypeScript (31), C++ (27), JavaScript (27), HTML (23), Java (22), R (18), Go (13)
Stargazers over time for topic data-analysis
Most starred repositories for topic data-analysis (view more)
Trending repositories for topic data-analysis (view more)
Build data pipelines with SQL and Python, ingest data from different sources, add quality checks, and build end-to-end flows.
This is a repo with links to everything you'd ever want to learn about data science
Chat with your database (SQL, CSV, pandas, polars, mongodb, noSQL, etc). PandasAI makes data analysis conversational using LLMs (GPT 3.5 / 4, Anthropic, VertexAI) and RAG.
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
Build data pipelines with SQL and Python, ingest data from different sources, add quality checks, and build end-to-end flows.
This is a repo with links to everything you'd ever want to learn about data science
One advanced and mature open-source MPP (Massively Parallel Processing) database. Open source alternative to Greenplum Database.
Build data pipelines with SQL and Python, ingest data from different sources, add quality checks, and build end-to-end flows.
This is a repo with links to everything you'd ever want to learn about data science
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
Chat with your database (SQL, CSV, pandas, polars, mongodb, noSQL, etc). PandasAI makes data analysis conversational using LLMs (GPT 3.5 / 4, Anthropic, VertexAI) and RAG.
Apache Superset is a Data Visualization and Data Exploration Platform
Build data pipelines with SQL and Python, ingest data from different sources, add quality checks, and build end-to-end flows.
This is a repo with links to everything you'd ever want to learn about data science
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
Python Client and Toolkit for DataFrames, Big Data, Machine Learning and ETL in Elasticsearch
Apache Superset is a Data Visualization and Data Exploration Platform
Build data pipelines with SQL and Python, ingest data from different sources, add quality checks, and build end-to-end flows.
Python Client and Toolkit for DataFrames, Big Data, Machine Learning and ETL in Elasticsearch
Build data pipelines with SQL and Python, ingest data from different sources, add quality checks, and build end-to-end flows.
One advanced and mature open-source MPP (Massively Parallel Processing) database. Open source alternative to Greenplum Database.
AI-Driven Research Assistant: An advanced multi-agent system for automating complex research processes. Leveraging LangChain, OpenAI GPT, and LangGraph, this tool streamlines hypothesis generation, da...
Dashboards and notebooks in a single place. Create powerful and flexible dashboards using code, or build beautiful Notion-like notebooks and share them with your team.
AI-Driven Research Assistant: An advanced multi-agent system for automating complex research processes. Leveraging LangChain, OpenAI GPT, and LangGraph, this tool streamlines hypothesis generation, da...
This is a repo with links to everything you'd ever want to learn about data science
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
Apache Superset is a Data Visualization and Data Exploration Platform
Streamlit — A faster way to build and share data apps.
The Cyber Swiss Army Knife - a web app for encryption, encoding, compression and data analysis
PyGWalker: Turn your pandas dataframe into an interactive UI for visual analysis
Dashboards and notebooks in a single place. Create powerful and flexible dashboards using code, or build beautiful Notion-like notebooks and share them with your team.
Chat with your data, modify it, visualize it, create and test machine learning models all in plain English. DataHorse makes data analysis and data science conversational using LLMs.
This is a repo with links to everything you'd ever want to learn about data science
Build data pipelines with SQL and Python, ingest data from different sources, add quality checks, and build end-to-end flows.