Trending repositories for topic data-science
This is a repo with links to everything you'd ever want to learn about data science
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
500 AI Machine learning Deep learning Computer vision NLP Projects with code
Chat with your database (SQL, CSV, pandas, polars, mongodb, noSQL, etc). PandasAI makes data analysis conversational using LLMs (GPT 3.5 / 4, Anthropic, VertexAI) and RAG.
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
A reactive notebook for Python — run reproducible experiments, execute as a script, deploy as an app, and version with git.
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
Apache Superset is a Data Visualization and Data Exploration Platform
PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.
12 weeks, 26 lessons, 52 quizzes, classic Machine Learning for all
Making data higher-quality, juicier, and more digestible for foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大模型提供更高质量、更丰富、更易”消化“的数据!
AKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库
Prefect is a workflow orchestration framework for building resilient data pipelines in Python.
This is a repo with links to everything you'd ever want to learn about data science
Cross Beat (xbe.at) - Your hub for python, machine learning and AI tutorials. Explore Python tutorials, AI insights, and more.
Pixeltable — AI Data infrastructure providing a declarative, incremental approach for multimodal workloads.
Fast and customizable framework for automatic and quick Causal Inference in Python
🔥🔥🔥 Latest Advances on Large Recommendation Models
Explore a collection of end-to-end data analytics projects showcasing SQL, Python, and Power BI. Gain valuable insights and solutions to real-world problems through data extraction, analysis, and visu...
⚡️SwanLab: your ML experiment notebook. 你的AI实验笔记本,日志记录与可视化AI训练全流程。
A roadmap to guide you through mastering SQL for Data Science in just 6 weeks for free
This is a repo with links to everything you'd ever want to learn about data science
500 AI Machine learning Deep learning Computer vision NLP Projects with code
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
A reactive notebook for Python — run reproducible experiments, execute as a script, deploy as an app, and version with git.
Chat with your database (SQL, CSV, pandas, polars, mongodb, noSQL, etc). PandasAI makes data analysis conversational using LLMs (GPT 3.5 / 4, Anthropic, VertexAI) and RAG.
Apache Superset is a Data Visualization and Data Exploration Platform
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.
12 weeks, 26 lessons, 52 quizzes, classic Machine Learning for all
AKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库
Making data higher-quality, juicier, and more digestible for foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大模型提供更高质量、更丰富、更易”消化“的数据!
Prefect is a workflow orchestration framework for building resilient data pipelines in Python.
Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.
:memo: An awesome Data Science repository to learn and apply for real world problems.
This is a repo with links to everything you'd ever want to learn about data science
Machine Learning Mischief: Examples from the dark side of data science
Become skilled in Artificial Intelligence, Machine Learning, Generative AI, Deep Learning, Data Science, Natural Language Processing, Reinforcement Learning and more with this complete 0 to 100 reposi...
🔥🔥🔥 Latest Advances on Large Recommendation Models
Cross Beat (xbe.at) - Your hub for python, machine learning and AI tutorials. Explore Python tutorials, AI insights, and more.
A simple package to abstract away the process of creating usable DataFrames for data analytics. This package is heavily inspired by the amazing Python library, Pandas.
Pixeltable — AI Data infrastructure providing a declarative, incremental approach for multimodal workloads.
Explore a collection of end-to-end data analytics projects showcasing SQL, Python, and Power BI. Gain valuable insights and solutions to real-world problems through data extraction, analysis, and visu...
Bayesian Neural Field models for prediction in large-scale spatiotemporal datasets
Repository for CARTE: Context-Aware Representation of Table Entries
🟣 Pandas interview questions and answers to help you prepare for your next machine learning and data science interview in 2024.
🔥🔥🔥 Latest Advances on Large Recommendation Models
Machine Learning Mischief: Examples from the dark side of data science
A reactive notebook for Python — run reproducible experiments, execute as a script, deploy as an app, and version with git.
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
Apache Superset is a Data Visualization and Data Exploration Platform
500 AI Machine learning Deep learning Computer vision NLP Projects with code
Python training for business analysts and traders
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
AI-Driven Research Assistant: An advanced multi-agent system for automating complex research processes. Leveraging LangChain, OpenAI GPT, and LangGraph, this tool streamlines hypothesis generation, da...
An orchestration platform for the development, production, and observation of data assets.
12 weeks, 26 lessons, 52 quizzes, classic Machine Learning for all
AKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库
Prefect is a workflow orchestration framework for building resilient data pipelines in Python.
PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.
This is a repo with links to everything you'd ever want to learn about data science
Chat with your database (SQL, CSV, pandas, polars, mongodb, noSQL, etc). PandasAI makes data analysis conversational using LLMs (GPT 3.5 / 4, Anthropic, VertexAI) and RAG.
🏆 A ranked list of awesome machine learning Python libraries. Updated weekly.
Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
Code for Machine Learning for Algorithmic Trading, 2nd edition.
Explore a collection of resources and projects in Computer Science, covering algorithms, data structures, programming languages, and emerging technologies. Ideal for learners and enthusiasts looking t...
AI-Driven Research Assistant: An advanced multi-agent system for automating complex research processes. Leveraging LangChain, OpenAI GPT, and LangGraph, this tool streamlines hypothesis generation, da...
Cross Beat (xbe.at) - Your hub for python, machine learning and AI tutorials. Explore Python tutorials, AI insights, and more.
Pixeltable — AI Data infrastructure providing a declarative, incremental approach for multimodal workloads.
JavaScript client library for MLflow, providing functionalities for machine learning lifecycle
This repository contains an in-depth analysis of the Intrusion Detection Evaluation Dataset (CIC-IDS2017) for Intrusion Detection, showcasing the implementation and comparison of different machine lea...
This is a repo with links to everything you'd ever want to learn about data science
🟣 LLMs interview questions and answers to help you prepare for your next machine learning and data science interview in 2024.
Become skilled in Artificial Intelligence, Machine Learning, Generative AI, Deep Learning, Data Science, Natural Language Processing, Reinforcement Learning and more with this complete 0 to 100 reposi...
Kickstart your MLOps initiative with a flexible, robust, and productive Python package.
Repository for CARTE: Context-Aware Representation of Table Entries
A Notebook Web Client with Flexible Customization and Easy Integration.
Explore a collection of end-to-end data analytics projects showcasing SQL, Python, and Power BI. Gain valuable insights and solutions to real-world problems through data extraction, analysis, and visu...
Research and development (R&D) is crucial for the enhancement of industrial productivity, especially in the AI era, where the core aspects of R&D are mainly focused on data and models. We are committe...
AI-powered Jupyter Notebook — use local AI to generate and edit code cells, automatically fix errors, and chat with your data
AI-Driven Research Assistant: An advanced multi-agent system for automating complex research processes. Leveraging LangChain, OpenAI GPT, and LangGraph, this tool streamlines hypothesis generation, da...
This is a repo with links to everything you'd ever want to learn about data science
Visual Data Transformation with Python Code Generation. Low-Code Python-based ETL.
AIDE: the state-of-the-art machine learning engineer agent, generating machine learning solution code from natural language descriptions.
This repository contains a reading list of papers on Time Series Segmentation. This repository is still being continuously improved.
🟣 LLMs interview questions and answers to help you prepare for your next machine learning and data science interview in 2024.
Here lies the resources and topics necessary for the role of Data Scientist and Machine Learning
Plotlars is a Rust library designed to facilitate the integration between the Polars data analysis library and Plotly library.
Chat with your data, modify it, visualize it, create and test machine learning models all in plain English. DataHorse makes data analysis and data science conversational using LLMs.
A roadmap to guide you through mastering SQL for Data Science in just 6 weeks for free
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
A reactive notebook for Python — run reproducible experiments, execute as a script, deploy as an app, and version with git.
Apache Superset is a Data Visualization and Data Exploration Platform
500 AI Machine learning Deep learning Computer vision NLP Projects with code
Streamlit — A faster way to build and share data apps.
12 weeks, 26 lessons, 52 quizzes, classic Machine Learning for all
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
Chat with your database (SQL, CSV, pandas, polars, mongodb, noSQL, etc). PandasAI makes data analysis conversational using LLMs (GPT 3.5 / 4, Anthropic, VertexAI) and RAG.
📺 Discover the latest machine learning / AI courses on YouTube.
Prefect is a workflow orchestration framework for building resilient data pipelines in Python.
Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.
🏆 A ranked list of awesome machine learning Python libraries. Updated weekly.
Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
Python training for business analysts and traders
10 Weeks, 20 Lessons, Data Science for All!
Pretrain, finetune ANY AI model of ANY size on multiple GPUs, TPUs with zero code changes.
:memo: An awesome Data Science repository to learn and apply for real world problems.
2025 AI/ML internship & new graduate job list updated daily
⚡️SwanLab: your ML experiment notebook. 你的AI实验笔记本,日志记录与可视化AI训练全流程。
Chat with your data, modify it, visualize it, create and test machine learning models all in plain English. DataHorse makes data analysis and data science conversational using LLMs.
A Notebook Web Client with Flexible Customization and Easy Integration.
Here lies the resources and topics necessary for the role of Data Scientist and Machine Learning
This is a repo with links to everything you'd ever want to learn about data science
This is a repository that I have created to showcase skills, share projects and track my progress in Data Analytics / Data Science related topics.
A Full Stack ML (Machine Learning) Roadmap involves learning the necessary skills and technologies to become proficient in all aspects of machine learning, including data collection and preprocessing,...
The book every data scientist needs on their desk.
Welcome to the "100 Project Ideas for Full Stack Developers" repository. This project was created with the aim of providing a diverse and inspiring collection of project ideas for full-stack developer...
A fully-featured batteries-included Neovim distribution for the world of Data Science. Prepared to run code and interact with Jupyter Notebooks without ever leaving your terminal.
A Python script that anonymizes an Excel file and synthesizes new data in its place.
A reactive notebook for Python — run reproducible experiments, execute as a script, deploy as an app, and version with git.
Pixeltable — AI Data infrastructure providing a declarative, incremental approach for multimodal workloads.