Trending repositories for topic data-science
Research and development (R&D) is crucial for the enhancement of industrial productivity, especially in the AI era, where the core aspects of R&D are mainly focused on data and models. We are committe...
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
A reactive notebook for Python — run reproducible experiments, query with SQL, execute as a script, deploy as an app, and version with git. All in a modern, AI-native editor.
Chat with your database or your datalake (SQL, CSV, parquet). PandasAI makes data analysis conversational using LLMs and RAG.
500 AI Machine learning Deep learning Computer vision NLP Projects with code
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
12 weeks, 26 lessons, 52 quizzes, classic Machine Learning for all
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
AKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库
Apache Superset is a Data Visualization and Data Exploration Platform
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷
Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.
Python training for business analysts and traders
A Python Library for Outlier and Anomaly Detection, Integrating Classical and Deep Learning Techniques
PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.
100+ AI Machine learning Deep learning Computer vision NLP Projects with code
Welcome to my personal Power BI portfolio repository! Here you will find a collection of Power BI projects and dashboards that demonstrate my skills and expertise in data visualization, business intel...
Research and development (R&D) is crucial for the enhancement of industrial productivity, especially in the AI era, where the core aspects of R&D are mainly focused on data and models. We are committe...
A comprehensive guide to building a modern data warehouse with SQL Server, including ETL processes, data modeling, and analytics.
Duck-UI is a web-based interface for interacting with DuckDB, a high-performance analytical database system. It features a SQL editor, data import/export, data explorer, query history, theme toggle, a...
Community extensions for TabPFN - the foundation model for tabular data. Built with TabPFN! 🤗
🏎️ A machine-learning approach to predict Formula 1 Grand Prix race outcomes.
Cross Beat (xbe.at) - Your hub for python, machine learning and AI tutorials. Explore Python tutorials, AI insights, and more.
Become skilled in Artificial Intelligence, Machine Learning, Generative AI, Deep Learning, Data Science, Natural Language Processing, Reinforcement Learning and more with this complete 0 to 100 reposi...
Package for processing and analyzing glycans and their role in biology.
Open source AI platform for rapid development of advanced AI and AGI pipelines.
A curated list of 100+ resources for building and deploying generative AI specifically focusing on helping you become a Generative AI Data Scientist with LLMs
Scalable identity resolution, entity resolution, data mastering and deduplication using ML
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
Explore my diverse collection of projects showcasing machine learning, data analysis, and more. Organized by project, each directory contains code, datasets, documentation, and resources. Dive in, to ...
Collection of Data Science PET Projects (Сборник PET-проектов Data Science)
Research and development (R&D) is crucial for the enhancement of industrial productivity, especially in the AI era, where the core aspects of R&D are mainly focused on data and models. We are committe...
A reactive notebook for Python — run reproducible experiments, query with SQL, execute as a script, deploy as an app, and version with git. All in a modern, AI-native editor.
Chat with your database or your datalake (SQL, CSV, parquet). PandasAI makes data analysis conversational using LLMs and RAG.
Python training for business analysts and traders
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
AKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
500 AI Machine learning Deep learning Computer vision NLP Projects with code
Apache Superset is a Data Visualization and Data Exploration Platform
12 weeks, 26 lessons, 52 quizzes, classic Machine Learning for all
Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷
Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.
Prefect is a workflow orchestration framework for building resilient data pipelines in Python.
Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
If you're coming from one of my data science tutorials, you'll find the code and the links to the tutorials here. I hope you find them helpful. Happy learning and coding!
Research and development (R&D) is crucial for the enhancement of industrial productivity, especially in the AI era, where the core aspects of R&D are mainly focused on data and models. We are committe...
A comprehensive guide to building a modern data warehouse with SQL Server, including ETL processes, data modeling, and analytics.
Best Data Science, Data Analytics, AI, and SDE roadmaps. This repository is continually updated based on the top job postings on LinkedIn and Indeed in the data science and AI domain.
The Harmony Python library: a research tool for psychologists to harmonise data and questionnaire items. Open source.
Welcome to my personal Power BI portfolio repository! Here you will find a collection of Power BI projects and dashboards that demonstrate my skills and expertise in data visualization, business intel...
Duck-UI is a web-based interface for interacting with DuckDB, a high-performance analytical database system. It features a SQL editor, data import/export, data explorer, query history, theme toggle, a...
🏎️ A machine-learning approach to predict Formula 1 Grand Prix race outcomes.
Community extensions for TabPFN - the foundation model for tabular data. Built with TabPFN! 🤗
100+ AI Machine learning Deep learning Computer vision NLP Projects with code
Here, I share .pbix files of the visualizations featured in my blog posts. Explore and download these files to dive deeper into the data behind the stories.
Become skilled in Artificial Intelligence, Machine Learning, Generative AI, Deep Learning, Data Science, Natural Language Processing, Reinforcement Learning and more with this complete 0 to 100 reposi...
Cross Beat (xbe.at) - Your hub for python, machine learning and AI tutorials. Explore Python tutorials, AI insights, and more.
Machine Learning Roadmap for 2025. Step-by-step guide to become a Data Scientist. Covers the best free learning resources from Python basics to Deep Learning and MLOps.
Chat with your database or your datalake (SQL, CSV, parquet). PandasAI makes data analysis conversational using LLMs and RAG.
Research and development (R&D) is crucial for the enhancement of industrial productivity, especially in the AI era, where the core aspects of R&D are mainly focused on data and models. We are committe...
A reactive notebook for Python — run reproducible experiments, query with SQL, execute as a script, deploy as an app, and version with git. All in a modern, AI-native editor.
Python training for business analysts and traders
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
AKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
Apache Superset is a Data Visualization and Data Exploration Platform
500 AI Machine learning Deep learning Computer vision NLP Projects with code
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
12 weeks, 26 lessons, 52 quizzes, classic Machine Learning for all
SkyPilot: Run AI and batch jobs on any infra (Kubernetes or 15+ clouds). Get unified execution, cost savings, and high GPU availability via a simple interface.
Prefect is a workflow orchestration framework for building resilient data pipelines in Python.
Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.
Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷
A high-level network analysis and graph mining library for Rust :crab:
This repository contains a collection of SQL scripts demonstrating various analytical techniques, such as changes over time, cumulative, performance, data segmentation, part-to-whole analysis.
Machine Learning Roadmap for 2025. Step-by-step guide to become a Data Scientist. Covers the best free learning resources from Python basics to Deep Learning and MLOps.
RushDB is an instant database for modern apps and DS/ML ops built on top of Neo4j
Research and development (R&D) is crucial for the enhancement of industrial productivity, especially in the AI era, where the core aspects of R&D are mainly focused on data and models. We are committe...
A comprehensive guide to building a modern data warehouse with SQL Server, including ETL processes, data modeling, and analytics.
Duck-UI is a web-based interface for interacting with DuckDB, a high-performance analytical database system. It features a SQL editor, data import/export, data explorer, query history, theme toggle, a...
If you're coming from one of my data science tutorials, you'll find the code and the links to the tutorials here. I hope you find them helpful. Happy learning and coding!
This is a repository to demonstrate my details, skills, projects and to keep track of my progression in Data Analytics and Data Science topics.
100+ AI Machine learning Deep learning Computer vision NLP Projects with code
Comprehensive repository of Data Science projects spanning Machine Learning, Deep Learning, and Natural Language Processing. Demonstrates practical applications of algorithms and tools on real-world d...
⚡️SwanLab - an open-source, modern-design AI training tracking and visualization tool. Supports Cloud / Self-hosted use. Integrated with PyTorch / Transformers / LLaMA Factory / Swift / Ultralytics / ...
Become skilled in Artificial Intelligence, Machine Learning, Generative AI, Deep Learning, Data Science, Natural Language Processing, Reinforcement Learning and more with this complete 0 to 100 reposi...
A curated list of 100+ resources for building and deploying generative AI specifically focusing on helping you become a Generative AI Data Scientist with LLMs
Research and development (R&D) is crucial for the enhancement of industrial productivity, especially in the AI era, where the core aspects of R&D are mainly focused on data and models. We are committe...
An AI-powered data science team of agents to help you perform common data science tasks 10X faster.
DATAGEN: AI-driven multi-agent research assistant automating hypothesis generation, data analysis, and report writing. Now expanding into crypto market intelligence. Learn more: https://datagen.digita...
Curated Data Science resources (Free & Paid) to help aspiring and experienced data scientists learn, grow, and advance their careers.
AI-powered Jupyter Notebook — use local AI to generate and edit code cells, automatically fix errors, and chat with your data
AIDE: AI-Driven Exploration in the Space of Code. State of the Art machine Learning engineering agents that automates AI R&D.
Plotlars is a Rust library designed to facilitate the integration between the Polars data analysis library and Plotly library.
A curated list of 100+ resources for building and deploying generative AI specifically focusing on helping you become a Generative AI Data Scientist with LLMs
Here lies the resources and topics necessary for the role of Data Scientist and Machine Learning
Cross Beat (xbe.at) - Your hub for python, machine learning and AI tutorials. Explore Python tutorials, AI insights, and more.
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
A reactive notebook for Python — run reproducible experiments, query with SQL, execute as a script, deploy as an app, and version with git. All in a modern, AI-native editor.
Chat with your database or your datalake (SQL, CSV, parquet). PandasAI makes data analysis conversational using LLMs and RAG.
Streamlit — A faster way to build and share data apps.
Apache Superset is a Data Visualization and Data Exploration Platform
500 AI Machine learning Deep learning Computer vision NLP Projects with code
12 weeks, 26 lessons, 52 quizzes, classic Machine Learning for all
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
Python training for business analysts and traders
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
🏆 A ranked list of awesome machine learning Python libraries. Updated weekly.
Prefect is a workflow orchestration framework for building resilient data pipelines in Python.
Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.
Research and development (R&D) is crucial for the enhancement of industrial productivity, especially in the AI era, where the core aspects of R&D are mainly focused on data and models. We are committe...
Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
10 Weeks, 20 Lessons, Data Science for All!
:memo: An awesome Data Science repository to learn and apply for real world problems.
Learn how to design, develop, deploy and iterate on production-grade ML applications.
PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.
Chat with your data - AI data analysis and visualization on CSV, Postgres, MySQL, Snowflake, SQLite...
2025 AI/ML internship & new graduate job list updated daily
Visual Data Transformation and Data Preparation. Low-Code Python-based ETL.
🟣 LLMs interview questions and answers to help you prepare for your next machine learning and data science interview in 2024.
Chat with your data, modify it, visualize it, create and test machine learning models all in plain English. DataHorse makes data analysis and data science conversational using LLMs.
Visualise your CSV files in seconds without sending your data anywhere
Here lies the resources and topics necessary for the role of Data Scientist and Machine Learning
Curated Data Science resources (Free & Paid) to help aspiring and experienced data scientists learn, grow, and advance their careers.
Coeus 🌐 is an OSINT ToolBox empowering users with tools for effective intelligence gathering from open sources. From social media monitoring 📱 to data analysis 📊, it offers a centralized platform f...
This repository contains Data & AI concepts covered on my Threads page.
The book every data scientist needs on their desk.