Trending repositories for topic data-science
Chat with your database or your datalake (SQL, CSV, parquet). PandasAI makes data analysis conversational using LLMs and RAG.
Research and development (R&D) is crucial for the enhancement of industrial productivity, especially in the AI era, where the core aspects of R&D are mainly focused on data and models. We are committe...
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
Apache Superset is a Data Visualization and Data Exploration Platform
A reactive notebook for Python — run reproducible experiments, execute as a script, deploy as an app, and version with git.
12 weeks, 26 lessons, 52 quizzes, classic Machine Learning for all
AKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库
500 AI Machine learning Deep learning Computer vision NLP Projects with code
Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
Code for Machine Learning for Algorithmic Trading, 2nd edition.
Prefect is a workflow orchestration framework for building resilient data pipelines in Python.
Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.
Pretrain, finetune ANY AI model of ANY size on multiple GPUs, TPUs with zero code changes.
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
🐍 Data Analysis with the Pandas Library & Notes 📊📈
Research and development (R&D) is crucial for the enhancement of industrial productivity, especially in the AI era, where the core aspects of R&D are mainly focused on data and models. We are committe...
Duck-UI is a web-based interface for interacting with DuckDB, a high-performance analytical database system. It features a SQL editor, data import/export, data explorer, query history, theme toggle, a...
Técnicas e recursos para estudar ciência de dados.
A library of quantiative algorithms for algorithmic trading implemented with Python
Data science, machine learning books and resources
A tool for gender bias identification in text. Part of Microsoft's Responsible AI toolbox.
MLRun/Iguazio/Nuclio quality gate solution. The solution checks a quality of MLRun implementation/delivery.
Pandas, Polars, Spark, and Snowpark DataFrame comparison for humans and more!
ML/AI meta-model, used in MLRun/Iguazio/Nuclio, see qgate-sln-<MLRun | solution>
An AI-powered data science team of agents to help you perform common data science tasks 10X faster.
⚡ Easy API access to the tabular foundation model TabPFN ⚡
Community extensions for TabPFN - the foundation model for tabular data. Built with TabPFN! 🤗
Python library for implementing Responsible AI mitigations.
A bunch of some 200 datasets. You can call it mini-kaggle :)
⚡️SwanLab - an open-source, modern-design AI training tracking and visualization tool. Supports Cloud / Self-hosted use. Integrated with PyTorch / Transformers / LLaMA Factory / Ultralytics / veRL / M...
Comprehensive guide to generative AI projects and resources in Julia.
Chat with your database or your datalake (SQL, CSV, parquet). PandasAI makes data analysis conversational using LLMs and RAG.
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
A reactive notebook for Python — run reproducible experiments, execute as a script, deploy as an app, and version with git.
Research and development (R&D) is crucial for the enhancement of industrial productivity, especially in the AI era, where the core aspects of R&D are mainly focused on data and models. We are committe...
Apache Superset is a Data Visualization and Data Exploration Platform
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
AKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
500 AI Machine learning Deep learning Computer vision NLP Projects with code
Code for Machine Learning for Algorithmic Trading, 2nd edition.
PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.
An AI-powered data science team of agents to help you perform common data science tasks 10X faster.
Python training for business analysts and traders
🏆 A ranked list of awesome machine learning Python libraries. Updated weekly.
Prefect is a workflow orchestration framework for building resilient data pipelines in Python.
12 weeks, 26 lessons, 52 quizzes, classic Machine Learning for all
Duck-UI is a web-based interface for interacting with DuckDB, a high-performance analytical database system. It features a SQL editor, data import/export, data explorer, query history, theme toggle, a...
🐍 Data Analysis with the Pandas Library & Notes 📊📈
Become skilled in Artificial Intelligence, Machine Learning, Generative AI, Deep Learning, Data Science, Natural Language Processing, Reinforcement Learning and more with this complete 0 to 100 reposi...
Research and development (R&D) is crucial for the enhancement of industrial productivity, especially in the AI era, where the core aspects of R&D are mainly focused on data and models. We are committe...
Data science, machine learning books and resources
MLRun/Iguazio/Nuclio quality gate solution. The solution checks a quality of MLRun implementation/delivery.
Welcome to my personal Power BI portfolio repository! Here you will find a collection of Power BI projects and dashboards that demonstrate my skills and expertise in data visualization, business intel...
ML/AI meta-model, used in MLRun/Iguazio/Nuclio, see qgate-sln-<MLRun | solution>
Técnicas e recursos para estudar ciência de dados.
An AI-powered data science team of agents to help you perform common data science tasks 10X faster.
A library of quantiative algorithms for algorithmic trading implemented with Python
Community extensions for TabPFN - the foundation model for tabular data. Built with TabPFN! 🤗
Cross Beat (xbe.at) - Your hub for python, machine learning and AI tutorials. Explore Python tutorials, AI insights, and more.
⚡ Easy API access to the tabular foundation model TabPFN ⚡
2025 AI/ML internship & new graduate job list updated daily
Duck-UI is a web-based interface for interacting with DuckDB, a high-performance analytical database system. It features a SQL editor, data import/export, data explorer, query history, theme toggle, a...
A reactive notebook for Python — run reproducible experiments, execute as a script, deploy as an app, and version with git.
🏆 A ranked list of awesome machine learning Python libraries. Updated weekly.
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
An AI-powered data science team of agents to help you perform common data science tasks 10X faster.
Apache Superset is a Data Visualization and Data Exploration Platform
Python training for business analysts and traders
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
500 AI Machine learning Deep learning Computer vision NLP Projects with code
Chat with your database or your datalake (SQL, CSV, parquet). PandasAI makes data analysis conversational using LLMs and RAG.
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.
12 weeks, 26 lessons, 52 quizzes, classic Machine Learning for all
Code for Machine Learning for Algorithmic Trading, 2nd edition.
AKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库
Research and development (R&D) is crucial for the enhancement of industrial productivity, especially in the AI era, where the core aspects of R&D are mainly focused on data and models. We are committe...
Prefect is a workflow orchestration framework for building resilient data pipelines in Python.
Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
A curated list of 100+ resources for building and deploying generative AI specifically focusing on helping you become a Generative AI Data Scientist with LLMs
An AI-powered data science team of agents to help you perform common data science tasks 10X faster.
Become skilled in Artificial Intelligence, Machine Learning, Generative AI, Deep Learning, Data Science, Natural Language Processing, Reinforcement Learning and more with this complete 0 to 100 reposi...
Best Data Science, Data Analytics, AI, and SDE roadmaps. This repository is continually updated based on the top job postings on LinkedIn and Indeed in the data science and AI domain.
Community extensions for TabPFN - the foundation model for tabular data. Built with TabPFN! 🤗
Data science, machine learning books and resources
This repository contains Data Science interview questions covered on my Threads page.
Cross Beat (xbe.at) - Your hub for python, machine learning and AI tutorials. Explore Python tutorials, AI insights, and more.
🐍 Data Analysis with the Pandas Library & Notes 📊📈
2025 AI/ML internship & new graduate job list updated daily
⚡ Easy API access to the tabular foundation model TabPFN ⚡
Computer science books from algorithms, data structure, programming, to data science, AI and much more.
Kickstart your MLOps initiative with a flexible, robust, and productive Python package.
Research and development (R&D) is crucial for the enhancement of industrial productivity, especially in the AI era, where the core aspects of R&D are mainly focused on data and models. We are committe...
An AI-powered data science team of agents to help you perform common data science tasks 10X faster.
AI-Driven Research Assistant: An advanced multi-agent system for automating complex research processes. Leveraging LangChain, OpenAI GPT, and LangGraph, this tool streamlines hypothesis generation, da...
This is a repo with links to everything you'd ever want to learn about data science
AI-powered Jupyter Notebook — use local AI to generate and edit code cells, automatically fix errors, and chat with your data
AIDE: the state-of-the-art machine learning engineer agent, generating machine learning solution code from natural language descriptions.
Plotlars is a Rust library designed to facilitate the integration between the Polars data analysis library and Plotly library.
Here lies the resources and topics necessary for the role of Data Scientist and Machine Learning
A curated list of 100+ resources for building and deploying generative AI specifically focusing on helping you become a Generative AI Data Scientist with LLMs
Chat with your data, modify it, visualize it, create and test machine learning models all in plain English. DataHorse makes data analysis and data science conversational using LLMs.
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
Apache Superset is a Data Visualization and Data Exploration Platform
500 AI Machine learning Deep learning Computer vision NLP Projects with code
Streamlit — A faster way to build and share data apps.
A reactive notebook for Python — run reproducible experiments, execute as a script, deploy as an app, and version with git.
12 weeks, 26 lessons, 52 quizzes, classic Machine Learning for all
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
Chat with your database or your datalake (SQL, CSV, parquet). PandasAI makes data analysis conversational using LLMs and RAG.
🏆 A ranked list of awesome machine learning Python libraries. Updated weekly.
Python training for business analysts and traders
Prefect is a workflow orchestration framework for building resilient data pipelines in Python.
Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.
Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
10 Weeks, 20 Lessons, Data Science for All!
:memo: An awesome Data Science repository to learn and apply for real world problems.
📺 Discover the latest machine learning / AI courses on YouTube.
Learn how to design, develop, deploy and iterate on production-grade ML applications.
2025 AI/ML internship & new graduate job list updated daily
Python everything Cheatsheet and a Journey to the land of Python programming
Welcome to the Data Science EBooks repository! This collection offers a variety of high-quality ebooks on Data Science, Machine Learning, and AI. Perfect for both beginners and advanced learners, expl...
Chat with your data, modify it, visualize it, create and test machine learning models all in plain English. DataHorse makes data analysis and data science conversational using LLMs.
Visualise your CSV files in seconds without sending your data anywhere
Here lies the resources and topics necessary for the role of Data Scientist and Machine Learning
This is a repo with links to everything you'd ever want to learn about data science
This repository contains Data Science interview questions covered on my Threads page.
Bayesian Neural Field models for prediction in large-scale spatiotemporal datasets
The book every data scientist needs on their desk.
Welcome to the "100 Project Ideas for Full Stack Developers" repository. This project was created with the aim of providing a diverse and inspiring collection of project ideas for full-stack developer...
Pixeltable — AI Data infrastructure providing a declarative, incremental approach for multimodal workloads.