Trending repositories for topic data-science
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
Apache Superset is a Data Visualization and Data Exploration Platform
500 AI Machine learning Deep learning Computer vision NLP Projects with code
:memo: An awesome Data Science repository to learn and apply for real world problems.
Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
Book_3_《数学要素》 | 鸢尾花书:从加减乘除到机器学习;上架;欢迎继续纠错,纠错多的同学还会有赠书!
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
12 weeks, 26 lessons, 52 quizzes, classic Machine Learning for all
Learn how to design, develop, deploy and iterate on production-grade ML applications.
Book_7_《机器学习》 | 鸢尾花书:从加减乘除到机器学习;欢迎批评指正
OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team colla...
A reactive notebook for Python — run reproducible experiments, execute as a script, deploy as an app, and version with git.
📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.
Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
Prefect is a workflow orchestration tool empowering developers to build, observe, and react to data pipelines
Automated prompt-based testing and evaluation of Gen AI applications
A comprehensive solution for monitoring your AI models in production
Analysis of F1 races and their drivers
Here lies the resources and topics necessary for the role of Data Scientist and Machine Learning
Jayvee is a domain-specific language and runtime for automated processing of data pipelines
Resources about solar power systems for data science
The best collection of AI tutorials to make you a boss of Data Science!
Hello everyone this repo will contain my journey of machine learning and DeepLearning with some exciting projects
Scripts and datasets for the O'Reilly book Python Polars: The Definitive Guide
Explore my diverse collection of projects showcasing machine learning, data analysis, and more. Organized by project, each directory contains code, datasets, documentation, and resources. Dive in, to ...
ML/AI meta-model, used in MLRun/Iguazio/Nuclio, see qgate-sln-<solution>
Open source AI platform for rapid development of advanced AI and AGI pipelines.
A fully-featured batteries-included Neovim distribution for the world of Data Science. Prepared to run code and interact with Jupyter Notebooks without ever leaving your terminal.
Installer for DataKitchen's Open Source Data Observability Products. Data breaks. Servers break. Your toolchain breaks. Ensure your team is the first to know and the first to solve with visibility acr...
An AI-powered Python notebook built in React — generate and edit code cells, automatically fix errors, and chat with your code
A Full Stack ML (Machine Learning) Roadmap involves learning the necessary skills and technologies to become proficient in all aspects of machine learning, including data collection and preprocessing,...
Scroll is a language for scientists of all ages. Scroll includes a command line app that builds static blogs, websites, CSVs, text files, and more.
The open-source tool for building high-quality datasets and computer vision models
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
Apache Superset is a Data Visualization and Data Exploration Platform
📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.
Learn how to design, develop, deploy and iterate on production-grade ML applications.
12 weeks, 26 lessons, 52 quizzes, classic Machine Learning for all
A reactive notebook for Python — run reproducible experiments, execute as a script, deploy as an app, and version with git.
OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team colla...
500 AI Machine learning Deep learning Computer vision NLP Projects with code
Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
Book_3_《数学要素》 | 鸢尾花书:从加减乘除到机器学习;上架;欢迎继续纠错,纠错多的同学还会有赠书!
:memo: An awesome Data Science repository to learn and apply for real world problems.
Chat with your database (SQL, CSV, pandas, polars, mongodb, noSQL, etc). PandasAI makes data analysis conversational using LLMs (GPT 3.5 / 4, Anthropic, VertexAI) and RAG.
A comprehensive solution for monitoring your AI models in production
Automated prompt-based testing and evaluation of Gen AI applications
Analysis of F1 races and their drivers
Scripts and datasets for the O'Reilly book Python Polars: The Definitive Guide
Here lies the resources and topics necessary for the role of Data Scientist and Machine Learning
ML-algorithms from scratch using Python. Classic Machine Learning course.
Promotes development of ML algorithms for early detection and classification of undesirable events in offshore oil wells.
Air Pollution Image Dataset from India and Nepal
Explore my diverse collection of projects showcasing machine learning, data analysis, and more. Organized by project, each directory contains code, datasets, documentation, and resources. Dive in, to ...
The open-source tool for building high-quality datasets and computer vision models
🟣 LLMs interview questions and answers to help you prepare for your next machine learning and data science interview in 2024.
Accompanying repository for my book about Graph Data Science
This repository contains resources in the form of ebooks, which are related to Data Science, Machine Learning, and similar topics.
A fully-featured batteries-included Neovim distribution for the world of Data Science. Prepared to run code and interact with Jupyter Notebooks without ever leaving your terminal.
Installer for DataKitchen's Open Source Data Observability Products. Data breaks. Servers break. Your toolchain breaks. Ensure your team is the first to know and the first to solve with visibility acr...
Automated prompt-based testing and evaluation of Gen AI applications
A fully-featured batteries-included Neovim distribution for the world of Data Science. Prepared to run code and interact with Jupyter Notebooks without ever leaving your terminal.
A comprehensive solution for monitoring your AI models in production
The open-source tool for building high-quality datasets and computer vision models
An AI-powered Python notebook built in React — generate and edit code cells, automatically fix errors, and chat with your code
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
Apache Superset is a Data Visualization and Data Exploration Platform
A reactive notebook for Python — run reproducible experiments, execute as a script, deploy as an app, and version with git.
Learn how to design, develop, deploy and iterate on production-grade ML applications.
📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.
12 weeks, 26 lessons, 52 quizzes, classic Machine Learning for all
OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team colla...
500 AI Machine learning Deep learning Computer vision NLP Projects with code
Chat with your database (SQL, CSV, pandas, polars, mongodb, noSQL, etc). PandasAI makes data analysis conversational using LLMs (GPT 3.5 / 4, Anthropic, VertexAI) and RAG.
Prefect is a workflow orchestration tool empowering developers to build, observe, and react to data pipelines
SkyPilot: Run LLMs, AI, and Batch jobs on any cloud. Get maximum savings, highest GPU availability, and managed execution—all with a simple interface.
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
A fully-featured batteries-included Neovim distribution for the world of Data Science. Prepared to run code and interact with Jupyter Notebooks without ever leaving your terminal.
A general-purpose library designed to guide developers in expressing their code as a flow.
Chat with your data using natural language. CSV, Postgres, MySQL, Snowflake, SQLite...
A comprehensive solution for monitoring your AI models in production
Automated prompt-based testing and evaluation of Gen AI applications
Air Pollution Image Dataset from India and Nepal
Scripts and datasets for the O'Reilly book Python Polars: The Definitive Guide
A roadmap for getting started with Machine Learning
🟣 LLMs interview questions and answers to help you prepare for your next machine learning and data science interview in 2024.
Collection of Snowflake Notebook demos, tutorials, and examples
Resources about solar power systems for data science
ML/AI meta-model, used in MLRun/Iguazio/Nuclio, see qgate-sln-<solution>
Here lies the resources and topics necessary for the role of Data Scientist and Machine Learning
A simple package to abstract away the process of creating usable DataFrames for data analytics. This package is heavily inspired by the amazing Python library, Pandas.
A reactive notebook for Python — run reproducible experiments, execute as a script, deploy as an app, and version with git.
A one-stop data processing system to make data higher-quality, juicier, and more digestible for LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大语言模型提供更高质量、更丰富、更易”消化“的数据!
An AI-powered Python notebook built in React — generate and edit code cells, automatically fix errors, and chat with your code
Epsilla is a high performance Vector Database Management System. Try out hosted Epsilla at https://cloud.epsilla.com/
Huge AI models catalog. A curated list of AI tools, platforms, and resources across various domains.
This repository contains a reading list of papers on Time Series Segmentation. This repository is still being continuously improved.
PlotAI - Your Ultimate Plotting Assistant! 📊🤖 Use ChatGPT-3.5 to create plots in Python and Matplotlib directly in your Python script or notebook.
AI-powered key driver analysis tool that pinpoints root cause behind metrics fluctuation in one minute.
A Full Stack ML (Machine Learning) Roadmap involves learning the necessary skills and technologies to become proficient in all aspects of machine learning, including data collection and preprocessing,...
A Tutorial for Setting R Development Environment with VScode, Dev Containers, and Docker
Une liste de ressources sur tout ce qui touche à la prise de décision : vidéos, tutoriels, livres, documents, thèses, articles, datasets et libs open source.
Greetings! 👋 I'm Loga Aswin, diving into a 100-day data science immersion from Python fundamentals to real-world applications. This space will be a live documentation of my journey, where code meets ...
Explore my diverse collection of projects showcasing machine learning, data analysis, and more. Organized by project, each directory contains code, datasets, documentation, and resources. Dive in, to ...
12 weeks, 26 lessons, 52 quizzes, classic Machine Learning for all
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
Apache Superset is a Data Visualization and Data Exploration Platform
Streamlit — A faster way to build and share data apps.
10 Weeks, 20 Lessons, Data Science for All!
500 AI Machine learning Deep learning Computer vision NLP Projects with code
Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
A reactive notebook for Python — run reproducible experiments, execute as a script, deploy as an app, and version with git.
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
Chat with your database (SQL, CSV, pandas, polars, mongodb, noSQL, etc). PandasAI makes data analysis conversational using LLMs (GPT 3.5 / 4, Anthropic, VertexAI) and RAG.
The open-source tool for building high-quality datasets and computer vision models
Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.
📺 Discover the latest machine learning / AI courses on YouTube.
Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
Pretrain, finetune and deploy AI models on multiple GPUs, TPUs with zero code changes.
SkyPilot: Run LLMs, AI, and Batch jobs on any cloud. Get maximum savings, highest GPU availability, and managed execution—all with a simple interface.
Learn how to design, develop, deploy and iterate on production-grade ML applications.
Prefect is a workflow orchestration tool empowering developers to build, observe, and react to data pipelines
A reactive notebook for Python — run reproducible experiments, execute as a script, deploy as an app, and version with git.
A Tutorial for Setting R Development Environment with VScode, Dev Containers, and Docker
Une liste de ressources sur tout ce qui touche à la prise de décision : vidéos, tutoriels, livres, documents, thèses, articles, datasets et libs open source.
Hello Data Enthusiast! I will be updating my 100-day Journey here along with detailed Code Files Starting from Essential Libraries to Advanced Machine Learning and Deep Learning Algorithm Theory with ...
Huge AI models catalog. A curated list of AI tools, platforms, and resources across various domains.
Starting a 100 Days Code Challenge for Learning Data Science from Scratch
Kickstart your MLOps initiative with a flexible, robust, and productive Python package.
A Kurtosis package for Python data engineers, deploying a Jupyter notebook along with a configurable set of databases, and a visualization tool (Streamlit)
A curated list of papers that released datasets along with their work
Carefully curated list of awesome data science resources.
Visual Pandas Selector: Visualize and interactively select time-series data
Jayvee is a domain-specific language and runtime for automated processing of data pipelines
Here lies the resources and topics necessary for the role of Data Scientist and Machine Learning
A Python script that anonymizes an Excel file and synthesizes new data in its place.
Buckaroo - the data wrangling assistant for pandas. Quickly explore dataframes, and run pandas commands via a GUI. Works inside the jupyter notebook.
Collection of free Notes,Courses,Videos,Projects,Articles and Repos Links To learn Machine learning ,Deep learning,Python,SQL,CNN,NLP,GAN,GNN,Transfomers,Flask,Django,and End to End Machine learning P...
In this repo, there are (beginner-upper) level projects in the field of data science. I will host these projects that I have done in this field every day in this repo. With the hope that it will be us...
Prism is the easiest way to develop, orchestrate, and execute data pipelines in Python.
A fully-featured batteries-included Neovim distribution for the world of Data Science. Prepared to run code and interact with Jupyter Notebooks without ever leaving your terminal.