Statistics for topic data-science
RepositoryStats tracks 651,542 Github repositories, of these 2,263 are tagged with the data-science topic. The most common primary language for repositories using this topic is Python (777). Other languages include: Jupyter Notebook (650), R (78), HTML (71), TypeScript (55), JavaScript (54), Go (41), C++ (38), Rust (24), Java (23)
Stargazers over time for topic data-science
Most starred repositories for topic data-science (view more)
Trending repositories for topic data-science (view more)
500 AI Machine learning Deep learning Computer vision NLP Projects with code
Go language library for reading and writing Microsoft Excel™ (XLAM / XLSM / XLSX / XLTM / XLTX) spreadsheets
A reactive notebook for Python — run reproducible experiments, query with SQL, execute as a script, deploy as an app, and version with git. All in a modern, AI-native editor.
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
12 weeks, 26 lessons, 52 quizzes, classic Machine Learning for all
Pixeltable — AI Data infrastructure providing a declarative, incremental approach for multimodal workloads.
Aquí tienes un acordeón no oficial de la certificación AI-900 Microsoft Certified: Azure AI Fundamentals. Espero te sirva para aprobar tu certificación
Buckaroo - the data wrangling assistant for pandas. Quickly explore dataframes, and run pandas commands via a GUI. Works inside the jupyter notebook.
This repository contains end-to-end sample projects designed to run with minimal effort across a variety of use cases, including data science, machine learning, deep learning, and generative AI method...
500 AI Machine learning Deep learning Computer vision NLP Projects with code
A reactive notebook for Python — run reproducible experiments, query with SQL, execute as a script, deploy as an app, and version with git. All in a modern, AI-native editor.
Python training for business analysts and traders
12 weeks, 26 lessons, 52 quizzes, classic Machine Learning for all
Aquí tienes un acordeón no oficial de la certificación AI-900 Microsoft Certified: Azure AI Fundamentals. Espero te sirva para aprobar tu certificación
Buckaroo - the data wrangling assistant for pandas. Quickly explore dataframes, and run pandas commands via a GUI. Works inside the jupyter notebook.
This repository contains end-to-end sample projects designed to run with minimal effort across a variety of use cases, including data science, machine learning, deep learning, and generative AI method...
About The most comprehensive SQL guide from a real-world expert! Learn everything from basics to advanced queries, optimizations, and real-world SQL
Pixeltable — AI Data infrastructure providing a declarative, incremental approach for multimodal workloads.
A reactive notebook for Python — run reproducible experiments, query with SQL, execute as a script, deploy as an app, and version with git. All in a modern, AI-native editor.
Chat with your database or your datalake (SQL, CSV, parquet). PandasAI makes data analysis conversational using LLMs and RAG.
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
Apache Superset is a Data Visualization and Data Exploration Platform
This repository contains end-to-end sample projects designed to run with minimal effort across a variety of use cases, including data science, machine learning, deep learning, and generative AI method...
Aquí tienes un acordeón no oficial de la certificación AI-900 Microsoft Certified: Azure AI Fundamentals. Espero te sirva para aprobar tu certificación
LinkAlign: Scalable Schema Linking for Real-World Large-Scale Multi-Database Text-to-SQL
About The most comprehensive SQL guide from a real-world expert! Learn everything from basics to advanced queries, optimizations, and real-world SQL
The CleanEnergyBot is a Telegram bot providing real-time electricity usage, CO2 forecasts, and energy-saving tips in Ireland, using data from EirGrid and GPT-3 analysis. It helps users make eco-friend...
An AI-powered data science team of agents to help you perform common data science tasks 10X faster.
DATAGEN: AI-driven multi-agent research assistant automating hypothesis generation, data analysis, and report writing. Now expanding into crypto market intelligence. Learn more: https://datagen.digita...
Curated Data Science resources (Free & Paid) to help aspiring and experienced data scientists learn, grow, and advance their careers.
A reactive notebook for Python — run reproducible experiments, query with SQL, execute as a script, deploy as an app, and version with git. All in a modern, AI-native editor.
Chat with your database or your datalake (SQL, CSV, parquet). PandasAI makes data analysis conversational using LLMs and RAG.
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
Streamlit — A faster way to build and share data apps.
Apache Superset is a Data Visualization and Data Exploration Platform
2025 AI/ML internship & new graduate job list updated daily
Chat with your data - AI data analysis and visualization on CSV, Postgres, MySQL, Snowflake, SQLite...
Visual Data Preparation and Transformation. Low-Code Python-based ETL.
Best Data Science, Data Analytics, AI, and SDE roadmaps. This repository is continually updated based on the top job postings on LinkedIn and Indeed in the data science and AI domain.