Trending repositories for topic data-science
Research and development (R&D) is crucial for the enhancement of industrial productivity, especially in the AI era, where the core aspects of R&D are mainly focused on data and models. We are committe...
Apache Superset is a Data Visualization and Data Exploration Platform
500 AI Machine learning Deep learning Computer vision NLP Projects with code
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
:memo: An awesome Data Science repository to learn and apply for real world problems.
Chat with your database or your datalake (SQL, CSV, parquet). PandasAI makes data analysis conversational using LLMs and RAG.
12 weeks, 26 lessons, 52 quizzes, classic Machine Learning for all
Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team colla...
PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
A reactive notebook for Python — run reproducible experiments, query with SQL, execute as a script, deploy as an app, and version with git. All in a modern, AI-native editor.
人工智能学习路线图,整理近200个实战案例与项目,免费提供配套教材,零基础入门,就业实战!包括:Python,数学,机器学习,数据分析,深度学习,计算机视觉,自然语言处理,PyTorch tensorflow machine-learning,deep-learning data-analysis data-mining mathematics data-science artificial-in...
Find your trading edge, using the fastest engine for backtesting, algorithmic trading, and research.
AKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库
Fahmatrix is a lightweight, modern Java library for working with tabular data, inspired by Python's Pandas and rooted in the idea of making data understanding (fahm) easy on the JVM.
Track Your Job Applications Automatically — Straight From Your Inbox. Built by jobseekers, for jobseekers. Free forever.
List of resources for Astronomy Data Science
Research and development (R&D) is crucial for the enhancement of industrial productivity, especially in the AI era, where the core aspects of R&D are mainly focused on data and models. We are committe...
Duck-UI is a web-based interface for interacting with DuckDB, a high-performance analytical database system. It features a SQL editor, data import/export, data explorer, query history, theme toggle, a...
My graduate level machine learning course, including student machine learning projects.
About The most comprehensive SQL guide from a real-world expert! Learn everything from basics to advanced queries, optimizations, and real-world SQL
Become skilled in Artificial Intelligence, Machine Learning, Generative AI, Deep Learning, Data Science, Natural Language Processing, Reinforcement Learning and more with this complete 0 to 100 reposi...
”数学不难“ 之 《线性代数不难》上下册,66话题完册;欢迎批评指正
Best Data Science, Data Analytics, AI, and SDE roadmaps. This repository is continually updated based on the top job postings on LinkedIn and Indeed in the data science and AI domain.
Buckaroo - The data table UI for Notebooks. Quickly explore dataframes, scroll through dataframes, search, sort, view summary stats and histograms. Works with Pandas, Polars, Jupyter, Marimo, VSCode...
Buckaroo - The data table UI for Notebooks. Quickly explore dataframes, scroll through dataframes, search, sort, view summary stats and histograms. Works with Pandas, Polars, Jupyter, Marimo, VSCode...
Research and development (R&D) is crucial for the enhancement of industrial productivity, especially in the AI era, where the core aspects of R&D are mainly focused on data and models. We are committe...
500 AI Machine learning Deep learning Computer vision NLP Projects with code
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
Chat with your database or your datalake (SQL, CSV, parquet). PandasAI makes data analysis conversational using LLMs and RAG.
Apache Superset is a Data Visualization and Data Exploration Platform
12 weeks, 26 lessons, 52 quizzes, classic Machine Learning for all
A reactive notebook for Python — run reproducible experiments, query with SQL, execute as a script, deploy as an app, and version with git. All in a modern, AI-native editor.
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team colla...
:memo: An awesome Data Science repository to learn and apply for real world problems.
Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
AKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库
An orchestration platform for the development, production, and observation of data assets.
Excelize is a Python port of Go Excelize library that allow you to write to and read from XLAM / XLSM / XLSX / XLTM / XLTX files.
Buckaroo - The data table UI for Notebooks. Quickly explore dataframes, scroll through dataframes, search, sort, view summary stats and histograms. Works with Pandas, Polars, Jupyter, Marimo, VSCode...
Fahmatrix is a lightweight, modern Java library for working with tabular data, inspired by Python's Pandas and rooted in the idea of making data understanding (fahm) easy on the JVM.
About The most comprehensive SQL guide from a real-world expert! Learn everything from basics to advanced queries, optimizations, and real-world SQL
Track Your Job Applications Automatically — Straight From Your Inbox. Built by jobseekers, for jobseekers. Free forever.
DeepShot is a machine learning model designed to predict NBA game outcomes using advanced team statistics and rolling averages. It combines historical performance trends with contextual game data to d...
🔍 Table Extraction Tool: A powerful open-source solution combining OCR and computer vision for extracting structured tabular data from images. Ideal for LLM preprocessing, data analysis, and automati...
Pixeltable — AI Data infrastructure providing a declarative, incremental approach for multimodal workloads.
Aquí tienes un acordeón no oficial de la certificación AI-900 Microsoft Certified: Azure AI Fundamentals. Espero te sirva para aprobar tu certificación
Become skilled in Artificial Intelligence, Machine Learning, Generative AI, Deep Learning, Data Science, Natural Language Processing, Reinforcement Learning and more with this complete 0 to 100 reposi...
Data Science projects on various problem statements and datasets using Data Analysis, Machine Learning Algorithms, Deep Learning Algorithms, Natural Language Processing, Business Intelligence concepts...
Machine Learning Roadmap for 2025. Step-by-step guide to become a Data Scientist. Covers the best free learning resources from Python basics to Deep Learning and MLOps.
Research and development (R&D) is crucial for the enhancement of industrial productivity, especially in the AI era, where the core aspects of R&D are mainly focused on data and models. We are committe...
This repository contains a collection of SQL scripts demonstrating various analytical techniques, such as changes over time, cumulative, performance, data segmentation, part-to-whole analysis.
Best Data Science, Data Analytics, AI, and SDE roadmaps. This repository is continually updated based on the top job postings on LinkedIn and Indeed in the data science and AI domain.
Code Repository for Machine Learning, Data Science and Generative AI with Python, Published by Packt
A curated list of valuable resources from our studies at the University of Tehran (UT), School of Electrical and Computer Engineering (ECE)
Fahmatrix is a lightweight, modern Java library for working with tabular data, inspired by Python's Pandas and rooted in the idea of making data understanding (fahm) easy on the JVM.
500 AI Machine learning Deep learning Computer vision NLP Projects with code
A reactive notebook for Python — run reproducible experiments, query with SQL, execute as a script, deploy as an app, and version with git. All in a modern, AI-native editor.
Chat with your database or your datalake (SQL, CSV, parquet). PandasAI makes data analysis conversational using LLMs and RAG.
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
Apache Superset is a Data Visualization and Data Exploration Platform
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
SkyPilot: Run AI and batch jobs on any infra (Kubernetes or 16+ clouds). Get unified execution, cost savings, and high GPU availability via a simple interface.
12 weeks, 26 lessons, 52 quizzes, classic Machine Learning for all
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
AKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库
Buckaroo - The data table UI for Notebooks. Quickly explore dataframes, scroll through dataframes, search, sort, view summary stats and histograms. Works with Pandas, Polars, Jupyter, Marimo, VSCode...
Research and development (R&D) is crucial for the enhancement of industrial productivity, especially in the AI era, where the core aspects of R&D are mainly focused on data and models. We are committe...
Prefect is a workflow orchestration framework for building resilient data pipelines in Python.
Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team colla...
Aquí tienes un acordeón no oficial de la certificación AI-900 Microsoft Certified: Azure AI Fundamentals. Espero te sirva para aprobar tu certificación
Buckaroo - The data table UI for Notebooks. Quickly explore dataframes, scroll through dataframes, search, sort, view summary stats and histograms. Works with Pandas, Polars, Jupyter, Marimo, VSCode...
Excelize is a Python port of Go Excelize library that allow you to write to and read from XLAM / XLSM / XLSX / XLTM / XLTX files.
About The most comprehensive SQL guide from a real-world expert! Learn everything from basics to advanced queries, optimizations, and real-world SQL
LinkAlign: Scalable Schema Linking for Real-World Large-Scale Multi-Database Text-to-SQL
DataMap is a browser-based app for visualizing data using heatmaps, PCA plots, and t-SNE plots.
The CleanEnergyBot is a Telegram bot providing real-time electricity usage, CO2 forecasts, and energy-saving tips in Ireland, using data from EirGrid and GPT-3 analysis. It helps users make eco-friend...
DeepShot is a machine learning model designed to predict NBA game outcomes using advanced team statistics and rolling averages. It combines historical performance trends with contextual game data to d...
This repository contains end-to-end sample projects designed to run with minimal effort across a variety of use cases, including data science, machine learning, deep learning, and generative AI method...
Fahmatrix is a lightweight, modern Java library for working with tabular data, inspired by Python's Pandas and rooted in the idea of making data understanding (fahm) easy on the JVM.
PDF DataSource for Apache Spark, allow to read PDF files directly to the DataFrame and ocr it
”数学不难“ 之 《线性代数不难》上下册,66话题完册;欢迎批评指正
A comprehensive guide to building a modern data warehouse with SQL Server, including ETL processes, data modeling, and analytics.
🟣 Recommendation Systems interview questions and answers to help you prepare for your next machine learning and data science interview in 2025.
Pixeltable — AI Data infrastructure providing a declarative, incremental approach for multimodal workloads.
This repository contains a collection of SQL scripts demonstrating various analytical techniques, such as changes over time, cumulative, performance, data segmentation, part-to-whole analysis.
An AI-powered data science team of agents to help you perform common data science tasks 10X faster.
DATAGEN: AI-driven multi-agent research assistant automating hypothesis generation, data analysis, and report writing. Now expanding into crypto market intelligence. Learn more: https://datagen.digita...
Curated Data Science resources (Free & Paid) to help aspiring and experienced data scientists learn, grow, and advance their careers.
A curated list of 100+ resources for building and deploying generative AI specifically focusing on helping you become a Generative AI Data Scientist with LLMs
Plotlars is a Rust library designed to facilitate the integration between the Polars data analysis library and Plotly library.
Cross Beat (xbe.at) - Your hub for python, machine learning and AI tutorials. Explore Python tutorials, AI insights, and more.
Become skilled in Artificial Intelligence, Machine Learning, Generative AI, Deep Learning, Data Science, Natural Language Processing, Reinforcement Learning and more with this complete 0 to 100 reposi...
A roadmap to guide you through mastering SQL for Data Science in just 6 weeks for free
Chat with your data, modify it, visualize it, create and test machine learning models all in plain English. DataHorse makes data analysis and data science conversational using LLMs.
Chat with your database or your datalake (SQL, CSV, parquet). PandasAI makes data analysis conversational using LLMs and RAG.
A reactive notebook for Python — run reproducible experiments, query with SQL, execute as a script, deploy as an app, and version with git. All in a modern, AI-native editor.
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
Streamlit — A faster way to build and share data apps.
Apache Superset is a Data Visualization and Data Exploration Platform
500 AI Machine learning Deep learning Computer vision NLP Projects with code
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
Python training for business analysts and traders
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
12 weeks, 26 lessons, 52 quizzes, classic Machine Learning for all
Research and development (R&D) is crucial for the enhancement of industrial productivity, especially in the AI era, where the core aspects of R&D are mainly focused on data and models. We are committe...
Prefect is a workflow orchestration framework for building resilient data pipelines in Python.
🏆 A ranked list of awesome machine learning Python libraries. Updated weekly.
Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.
Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
AKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库
PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.
Code for Machine Learning for Algorithmic Trading, 2nd edition.
Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷
2025 AI/ML internship & new graduate job list updated daily
Chat with your data - AI data analysis and visualization on CSV, Postgres, MySQL, Snowflake, SQLite...
Visual Data Preparation and Transformation. Low-Code Python-based ETL.
Best Data Science, Data Analytics, AI, and SDE roadmaps. This repository is continually updated based on the top job postings on LinkedIn and Indeed in the data science and AI domain.
Chat with your data, modify it, visualize it, create and test machine learning models all in plain English. DataHorse makes data analysis and data science conversational using LLMs.
Visualise your CSV files in seconds without sending your data anywhere
Curated Data Science resources (Free & Paid) to help aspiring and experienced data scientists learn, grow, and advance their careers.
This repository contains Data & AI concepts covered on my Threads page.
The book every data scientist needs on their desk.
A fully-featured batteries-included Neovim distribution for the world of Data Science. Prepared to run code and interact with Jupyter Notebooks without ever leaving your terminal.
🟣 Computer Vision interview questions and answers to help you prepare for your next machine learning and data science interview in 2025.
This is a repository to demonstrate my details, skills, projects and to keep track of my progression in Data Analytics and Data Science topics.