Trending repositories for topic data-analysis
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
Apache Superset is a Data Visualization and Data Exploration Platform
The simplest, fastest way to get business intelligence and analytics to everyone in your company :yum:
Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
AKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库
The Cyber Swiss Army Knife - a web app for encryption, encoding, compression and data analysis
AI-Driven Research Assistant: An advanced multi-agent system for automating complex research processes. Leveraging LangChain, OpenAI GPT, and LangGraph, this tool streamlines hypothesis generation, da...
人工智能学习路线图,整理近200个实战案例与项目,免费提供配套教材,零基础入门,就业实战!包括:Python,数学,机器学习,数据分析,深度学习,计算机视觉,自然语言处理,PyTorch tensorflow machine-learning,deep-learning data-analysis data-mining mathematics data-science artificial-in...
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
🏆 A ranked list of awesome machine learning Python libraries. Updated weekly.
Chat with your database (SQL, CSV, pandas, polars, mongodb, noSQL, etc). PandasAI makes data analysis conversational using LLMs (GPT 3.5 / 4, Anthropic, VertexAI) and RAG.
One advanced and mature open-source MPP (Massively Parallel Processing) database. Open source alternative to Greenplum Database.
Making data higher-quality, juicier, and more digestible for foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大模型提供更高质量、更丰富、更易”消化“的数据!
GoAccess is a real-time web log analyzer and interactive viewer that runs in a terminal in *nix systems or through your browser.
Roadmap to becoming an Artificial Intelligence Expert in 2022
「数据可视化工具:报表、大屏、仪表盘」积木报表是一款类Excel操作风格,在线拖拽设计的报表工具和和数据可视化产品。功能涵盖: 报表设计、大屏设计、打印设计、图形报表、仪表盘门户设计等,完全免费!秉承“简单、易用、专业”的产品理念,极大的降低报表开发难度、缩短开发周期、解决各类报表难题。
This is a repository that I have created to showcase skills, share projects and track my progress in Data Analytics / Data Science related topics.
Quantitative Investment Strategies (QIS) package implements Python analytics for visualisation of financial data, performance reporting, analysis of quantitative strategies.
Using a combination of Excel, SQL, and Tableau, I delved into the extensive datasets comprising over 82k rows of data from Netflix's shows and movies library. Through data simplification and analysis,...
One advanced and mature open-source MPP (Massively Parallel Processing) database. Open source alternative to Greenplum Database.
AI-Driven Research Assistant: An advanced multi-agent system for automating complex research processes. Leveraging LangChain, OpenAI GPT, and LangGraph, this tool streamlines hypothesis generation, da...
A framework for rapid data (JSON) analysis, shareable serverless reports and dashboards
A Full Stack ML (Machine Learning) Roadmap involves learning the necessary skills and technologies to become proficient in all aspects of machine learning, including data collection and preprocessing,...
Powerful Analytics Solution. Setup in 30 seconds. Display all your data on a Simple, AI-powered dashboard. Fully self-hostable and GDPR compliant.
Stack overflow is a professional community for developers. This repo analysis 3 years of developer Survey done by Stackoverflow and do visualization and predict the salary of Data Scientist in future...
Making data higher-quality, juicier, and more digestible for foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大模型提供更高质量、更丰富、更易”消化“的数据!
ArcticDB is a high performance, serverless DataFrame database built for the Python Data Science ecosystem.
A curated list of awesome machine learning frameworks, libraries, courses, books and many more.
Rill is a tool for effortlessly transforming data sets into powerful, opinionated dashboards using SQL. BI-as-code.
AKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库
Notes for Deep Learning Specialization Courses led by Andrew Ng.
Apache Superset is a Data Visualization and Data Exploration Platform
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
The simplest, fastest way to get business intelligence and analytics to everyone in your company :yum:
AKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库
Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
🏆 A ranked list of awesome machine learning Python libraries. Updated weekly.
The Cyber Swiss Army Knife - a web app for encryption, encoding, compression and data analysis
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
人工智能学习路线图,整理近200个实战案例与项目,免费提供配套教材,零基础入门,就业实战!包括:Python,数学,机器学习,数据分析,深度学习,计算机视觉,自然语言处理,PyTorch tensorflow machine-learning,deep-learning data-analysis data-mining mathematics data-science artificial-in...
Chat with your database (SQL, CSV, pandas, polars, mongodb, noSQL, etc). PandasAI makes data analysis conversational using LLMs (GPT 3.5 / 4, Anthropic, VertexAI) and RAG.
AI-Driven Research Assistant: An advanced multi-agent system for automating complex research processes. Leveraging LangChain, OpenAI GPT, and LangGraph, this tool streamlines hypothesis generation, da...
GoAccess is a real-time web log analyzer and interactive viewer that runs in a terminal in *nix systems or through your browser.
Making data higher-quality, juicier, and more digestible for foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大模型提供更高质量、更丰富、更易”消化“的数据!
PyGWalker: Turn your pandas dataframe into an interactive UI for visual analysis
Roadmap to becoming an Artificial Intelligence Expert in 2022
「数据可视化工具:报表、大屏、仪表盘」积木报表是一款类Excel操作风格,在线拖拽设计的报表工具和和数据可视化产品。功能涵盖: 报表设计、大屏设计、打印设计、图形报表、仪表盘门户设计等,完全免费!秉承“简单、易用、专业”的产品理念,极大的降低报表开发难度、缩短开发周期、解决各类报表难题。
Build data pipelines with SQL and Python, ingest data from different sources, add quality checks, and build end-to-end flows.
Quantitative Investment Strategies (QIS) package implements Python analytics for visualisation of financial data, performance reporting, analysis of quantitative strategies.
This is a repository that I have created to showcase skills, share projects and track my progress in Data Analytics / Data Science related topics.
Using a combination of Excel, SQL, and Tableau, I delved into the extensive datasets comprising over 82k rows of data from Netflix's shows and movies library. Through data simplification and analysis,...
AI-Driven Research Assistant: An advanced multi-agent system for automating complex research processes. Leveraging LangChain, OpenAI GPT, and LangGraph, this tool streamlines hypothesis generation, da...
One advanced and mature open-source MPP (Massively Parallel Processing) database. Open source alternative to Greenplum Database.
A project providing a Graphic Walker Pane for use with HoloViz Panel.
A framework for rapid data (JSON) analysis, shareable serverless reports and dashboards
Revolutionize the way we interact with SQL databases using Generative AI
A Full Stack ML (Machine Learning) Roadmap involves learning the necessary skills and technologies to become proficient in all aspects of machine learning, including data collection and preprocessing,...
ArcticDB is a high performance, serverless DataFrame database built for the Python Data Science ecosystem.
Code and Data for the Second Edition of "Practical SQL" by Anthony DeBarros, published by No Starch Press (2022).
Making data higher-quality, juicier, and more digestible for foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大模型提供更高质量、更丰富、更易”消化“的数据!
Export Spotify playlists using the Web API. Analyze them in the Jupyter notebook.
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
Python Client and Toolkit for DataFrames, Big Data, Machine Learning and ETL in Elasticsearch
Apache Superset is a Data Visualization and Data Exploration Platform
🏆 A ranked list of awesome machine learning Python libraries. Updated weekly.
Chat with your database (SQL, CSV, pandas, polars, mongodb, noSQL, etc). PandasAI makes data analysis conversational using LLMs (GPT 3.5 / 4, Anthropic, VertexAI) and RAG.
AI-Driven Research Assistant: An advanced multi-agent system for automating complex research processes. Leveraging LangChain, OpenAI GPT, and LangGraph, this tool streamlines hypothesis generation, da...
The Cyber Swiss Army Knife - a web app for encryption, encoding, compression and data analysis
The simplest, fastest way to get business intelligence and analytics to everyone in your company :yum:
AKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
One advanced and mature open-source MPP (Massively Parallel Processing) database. Open source alternative to Greenplum Database.
Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
Making data higher-quality, juicier, and more digestible for foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大模型提供更高质量、更丰富、更易”消化“的数据!
人工智能学习路线图,整理近200个实战案例与项目,免费提供配套教材,零基础入门,就业实战!包括:Python,数学,机器学习,数据分析,深度学习,计算机视觉,自然语言处理,PyTorch tensorflow machine-learning,deep-learning data-analysis data-mining mathematics data-science artificial-in...
GoAccess is a real-time web log analyzer and interactive viewer that runs in a terminal in *nix systems or through your browser.
Python Client and Toolkit for DataFrames, Big Data, Machine Learning and ETL in Elasticsearch
A project providing a Graphic Walker Pane for use with HoloViz Panel.
One advanced and mature open-source MPP (Massively Parallel Processing) database. Open source alternative to Greenplum Database.
AI-Driven Research Assistant: An advanced multi-agent system for automating complex research processes. Leveraging LangChain, OpenAI GPT, and LangGraph, this tool streamlines hypothesis generation, da...
Practices on data analysis including: cleaning, visualization and EDA on different datasets using Python, SQL, Power BI, etc.
A curated list of great blockchain or crypto-based research tools, products, and services, that allows you to research the market in a way like never before!❤️🔥
Build data pipelines with SQL and Python, ingest data from different sources, add quality checks, and build end-to-end flows.
This repository contains resources in the form of ebooks, which are related to Data Science, Machine Learning, and similar topics.
Rank images using TrueSkill by comparing them against each other in the browser. 🖼📊
This is a repo with links to everything you'd ever want to learn about data science
This is a repository that I have created to showcase skills, share projects and track my progress in Data Analytics / Data Science related topics.
Quantitative Investment Strategies (QIS) package implements Python analytics for visualisation of financial data, performance reporting, analysis of quantitative strategies.
A Full Stack ML (Machine Learning) Roadmap involves learning the necessary skills and technologies to become proficient in all aspects of machine learning, including data collection and preprocessing,...
Dashboards and notebooks in a single place. Create powerful and flexible dashboards using code, or build beautiful Notion-like notebooks and share them with your team.
AI-Driven Research Assistant: An advanced multi-agent system for automating complex research processes. Leveraging LangChain, OpenAI GPT, and LangGraph, this tool streamlines hypothesis generation, da...
Visual Data Transformation with Python Code Generation. Low-Code Python-based ETL.
Powerful Analytics Solution. Setup in 30 seconds. Display all your data on a Simple, AI-powered dashboard. Fully self-hostable and GDPR compliant.
This is a repo with links to everything you'd ever want to learn about data science
Chat with your data, modify it, visualize it, create and test machine learning models all in plain English. DataHorse makes data analysis and data science conversational using LLMs.
Edu-ConvoKit: An Open-Source Framework for Education Conversation Data
PSDuckDB is a PowerShell module that provides seamless integration with DuckDB, enabling efficient execution of analytical SQL queries directly from the PowerShell environment.
Data Neuron is a powerful framework that enables you to build text-to-SQL applications with an easily maintainable semantic layer. Whether you're creating customer-facing chatbots, internal Slack bots...
A Data analysis agent powered by llm for querying database and visualizing results
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
Apache Superset is a Data Visualization and Data Exploration Platform
Streamlit — A faster way to build and share data apps.
The Cyber Swiss Army Knife - a web app for encryption, encoding, compression and data analysis
PyGWalker: Turn your pandas dataframe into an interactive UI for visual analysis
Chat with your database (SQL, CSV, pandas, polars, mongodb, noSQL, etc). PandasAI makes data analysis conversational using LLMs (GPT 3.5 / 4, Anthropic, VertexAI) and RAG.
The simplest, fastest way to get business intelligence and analytics to everyone in your company :yum:
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
Dashboards and notebooks in a single place. Create powerful and flexible dashboards using code, or build beautiful Notion-like notebooks and share them with your team.
🏆 A ranked list of awesome machine learning Python libraries. Updated weekly.
10 Weeks, 20 Lessons, Data Science for All!
Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
A code-first agent framework for seamlessly planning and executing data analytics tasks.
人工智能学习路线图,整理近200个实战案例与项目,免费提供配套教材,零基础入门,就业实战!包括:Python,数学,机器学习,数据分析,深度学习,计算机视觉,自然语言处理,PyTorch tensorflow machine-learning,deep-learning data-analysis data-mining mathematics data-science artificial-in...
Making data higher-quality, juicier, and more digestible for foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大模型提供更高质量、更丰富、更易”消化“的数据!
🙌 Welcome open-source Python mini-project contributions!
AKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库
Dashboards and notebooks in a single place. Create powerful and flexible dashboards using code, or build beautiful Notion-like notebooks and share them with your team.
Chat with your data, modify it, visualize it, create and test machine learning models all in plain English. DataHorse makes data analysis and data science conversational using LLMs.
Lyzr SDKs help you to build all your favorite GenAI SaaS products as enterprise applications in minutes.
This is a repo with links to everything you'd ever want to learn about data science
A Full Stack ML (Machine Learning) Roadmap involves learning the necessary skills and technologies to become proficient in all aspects of machine learning, including data collection and preprocessing,...
This is a repository that I have created to showcase skills, share projects and track my progress in Data Analytics / Data Science related topics.
PySpark Tutorial for Beginners - Practical Examples in Jupyter Notebook with Spark version 3.4.1. The tutorial covers various topics like Spark Introduction, Spark Installation, Spark RDD Transformati...
This repository is to show my Data Analytics & Engineering skills, share projects, and track my progress.
This repository was created to showcase my analytical and technical skills (Excel, Python, SQL, Power BI, PowerPoint, and others).
Stack overflow is a professional community for developers. This repo analysis 3 years of developer Survey done by Stackoverflow and do visualization and predict the salary of Data Scientist in future...
Build data pipelines with SQL and Python, ingest data from different sources, add quality checks, and build end-to-end flows.
One advanced and mature open-source MPP (Massively Parallel Processing) database. Open source alternative to Greenplum Database.
Get started with SQL database programming. This beginner's guide provides step-by-step tutorials, practical examples, exercises, and resources to master SQL. Let's unlock the power of data with SQL!
Using a combination of Excel, SQL, and Tableau, I delved into the extensive datasets comprising over 82k rows of data from Netflix's shows and movies library. Through data simplification and analysis,...
Rank images using TrueSkill by comparing them against each other in the browser. 🖼📊
Making data higher-quality, juicier, and more digestible for foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大模型提供更高质量、更丰富、更易”消化“的数据!