Statistics for topic data-science
RepositoryStats tracks 616,864 Github repositories, of these 2,192 are tagged with the data-science topic. The most common primary language for repositories using this topic is Python (763). Other languages include: Jupyter Notebook (624), R (75), HTML (71), TypeScript (53), JavaScript (52), Go (39), C++ (36), Java (23), Rust (21)
Stargazers over time for topic data-science
Most starred repositories for topic data-science (view more)
Trending repositories for topic data-science (view more)
A reactive notebook for Python — run reproducible experiments, execute as a script, deploy as an app, and version with git.
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
A comprehensive guide to building a modern data warehouse with SQL Server, including ETL processes, data modeling, and analytics.
Duck-UI is a web-based interface for interacting with DuckDB, a high-performance analytical database system. It features a SQL editor, data import/export, data explorer, query history, theme toggle, a...
⚡️SwanLab - an open-source, modern-design AI training tracking and visualization tool. Supports Cloud / Self-hosted use. Integrated with PyTorch / Transformers / LLaMA Factory / Swift / Ultralytics / ...
Morph is a python-centric full-stack framework for building and deploying data apps.
Production-grade ML - F# power & precision guiding Torch performance
A reactive notebook for Python — run reproducible experiments, execute as a script, deploy as an app, and version with git.
Chat with your database or your datalake (SQL, CSV, parquet). PandasAI makes data analysis conversational using LLMs and RAG.
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
Research and development (R&D) is crucial for the enhancement of industrial productivity, especially in the AI era, where the core aspects of R&D are mainly focused on data and models. We are committe...
A comprehensive guide to building a modern data warehouse with SQL Server, including ETL processes, data modeling, and analytics.
🐍 Data Analysis with the Pandas Library & Notes 📊📈
Duck-UI is a web-based interface for interacting with DuckDB, a high-performance analytical database system. It features a SQL editor, data import/export, data explorer, query history, theme toggle, a...
Research and development (R&D) is crucial for the enhancement of industrial productivity, especially in the AI era, where the core aspects of R&D are mainly focused on data and models. We are committe...
Production-grade ML - F# power & precision guiding Torch performance
A reactive notebook for Python — run reproducible experiments, execute as a script, deploy as an app, and version with git.
🏆 A ranked list of awesome machine learning Python libraries. Updated weekly.
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
Apache Superset is a Data Visualization and Data Exploration Platform
An AI-powered data science team of agents to help you perform common data science tasks 10X faster.
Become skilled in Artificial Intelligence, Machine Learning, Generative AI, Deep Learning, Data Science, Natural Language Processing, Reinforcement Learning and more with this complete 0 to 100 reposi...
Research and development (R&D) is crucial for the enhancement of industrial productivity, especially in the AI era, where the core aspects of R&D are mainly focused on data and models. We are committe...
An AI-powered data science team of agents to help you perform common data science tasks 10X faster.
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
Apache Superset is a Data Visualization and Data Exploration Platform
500 AI Machine learning Deep learning Computer vision NLP Projects with code
Streamlit — A faster way to build and share data apps.
A reactive notebook for Python — run reproducible experiments, execute as a script, deploy as an app, and version with git.
2025 AI/ML internship & new graduate job list updated daily
Python everything Cheatsheet and a Journey to the land of Python programming
Welcome to the Data Science EBooks repository! This collection offers a variety of high-quality ebooks on Data Science, Machine Learning, and AI. Perfect for both beginners and advanced learners, expl...
Chat with your data, modify it, visualize it, create and test machine learning models all in plain English. DataHorse makes data analysis and data science conversational using LLMs.