Trending repositories for topic data-mining
人工智能学习路线图,整理近200个实战案例与项目,免费提供配套教材,零基础入门,就业实战!包括:Python,数学,机器学习,数据分析,深度学习,计算机视觉,自然语言处理,PyTorch tensorflow machine-learning,deep-learning data-analysis data-mining mathematics data-science artificial-in...
A Python Library for Outlier and Anomaly Detection, Integrating Classical and Deep Learning Techniques
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
Research and development (R&D) is crucial for the enhancement of industrial productivity, especially in the AI era, where the core aspects of R&D are mainly focused on data and models. We are committe...
A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tas...
:memo: An awesome Data Science repository to learn and apply for real world problems.
A Python toolkit/library for reality-centric machine/deep learning and data mining on partially-observed time series, including SOTA neural network models for scientific analysis tasks of imputation/c...
Welcome to the Data Science EBooks repository! This collection offers a variety of high-quality ebooks on Data Science, Machine Learning, and AI. Perfect for both beginners and advanced learners, expl...
A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other machine learning tasks for Python, R, Java, C++. Supports computa...
Anomaly detection related books, papers, videos, and toolboxes
Comprehensive and timely academic information on federated learning (papers, frameworks, datasets, tutorials, workshops)
Desbordante is a high-performance data profiler that is capable of discovering many different patterns in data using various algorithms. It also allows to run data cleaning scenarios using these algor...
Official public repository for PM4Py (Process Mining for Python) — an open-source library for exploring, analyzing, and optimizing business processes with Python.
A Guide for Feature Engineering and Feature Selection, with implementations and examples in Python.
A curated list of graph-based fraud, anomaly, and outlier detection papers & resources
📖 A curated list of awesome time-series papers, benchmarks, datasets, tutorials. (WIP)
Machine Learning Roadmap for 2025. Step-by-step guide to become a Data Scientist. Covers the best free learning resources from Python basics to Deep Learning and MLOps.
🔥🔥🔥 Latest Advances on Large Recommendation Models
RAVEN is a flexible and multi-purpose probabilistic risk analysis, validation and uncertainty quantification, parameter optimization, model reduction and data knowledge-discovering framework.
Welcome to the Data Science EBooks repository! This collection offers a variety of high-quality ebooks on Data Science, Machine Learning, and AI. Perfect for both beginners and advanced learners, expl...
A Python toolkit/library for reality-centric machine/deep learning and data mining on partially-observed time series, including SOTA neural network models for scientific analysis tasks of imputation/c...
Desbordante is a high-performance data profiler that is capable of discovering many different patterns in data using various algorithms. It also allows to run data cleaning scenarios using these algor...
Coursera Specialization: Machine Learning and Data Analysis (Yandex & MIPT)
Research and development (R&D) is crucial for the enhancement of industrial productivity, especially in the AI era, where the core aspects of R&D are mainly focused on data and models. We are committe...
人工智能学习路线图,整理近200个实战案例与项目,免费提供配套教材,零基础入门,就业实战!包括:Python,数学,机器学习,数据分析,深度学习,计算机视觉,自然语言处理,PyTorch tensorflow machine-learning,deep-learning data-analysis data-mining mathematics data-science artificial-in...
A Python Library for Outlier and Anomaly Detection, Integrating Classical and Deep Learning Techniques
Official public repository for PM4Py (Process Mining for Python) — an open-source library for exploring, analyzing, and optimizing business processes with Python.
Deep and conventional community detection related papers, implementations, datasets, and tools.
Comprehensive and timely academic information on federated learning (papers, frameworks, datasets, tutorials, workshops)
A Guide for Feature Engineering and Feature Selection, with implementations and examples in Python.
A curated list of graph-based fraud, anomaly, and outlier detection papers & resources
:video_game: A curated list of awesome game datasets, and tools to artificial intelligence in games
A Python Library for Outlier and Anomaly Detection, Integrating Classical and Deep Learning Techniques
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
Research and development (R&D) is crucial for the enhancement of industrial productivity, especially in the AI era, where the core aspects of R&D are mainly focused on data and models. We are committe...
人工智能学习路线图,整理近200个实战案例与项目,免费提供配套教材,零基础入门,就业实战!包括:Python,数学,机器学习,数据分析,深度学习,计算机视觉,自然语言处理,PyTorch tensorflow machine-learning,deep-learning data-analysis data-mining mathematics data-science artificial-in...
:memo: An awesome Data Science repository to learn and apply for real world problems.
A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tas...
A Python toolkit/library for reality-centric machine/deep learning and data mining on partially-observed time series, including SOTA neural network models for scientific analysis tasks of imputation/c...
A professional list of Papers, Tutorials, and Surveys on AI for Time Series in top AI conferences and journals.
Machine Learning From Scratch. Bare bones NumPy implementations of machine learning models and algorithms with a focus on accessibility. Aims to cover everything from linear regression to deep learnin...
UnityPy is python module that makes it possible to extract/unpack and edit Unity assets
Welcome to the Data Science EBooks repository! This collection offers a variety of high-quality ebooks on Data Science, Machine Learning, and AI. Perfect for both beginners and advanced learners, expl...
A curated list of awesome open source libraries to deploy, monitor, version and scale your machine learning
Anomaly detection related books, papers, videos, and toolboxes
An open source alternative to Tableau. Embeddable visual analytic
🔥🔥🔥 Latest Advances on Large Recommendation Models
📖 A curated list of awesome time-series papers, benchmarks, datasets, tutorials. (WIP)
C# KQL query engine with flexible I/O layers and visualization
Machine Learning Roadmap for 2025. Step-by-step guide to become a Data Scientist. Covers the best free learning resources from Python basics to Deep Learning and MLOps.
Yet Another Reddit Scrapper (without API keys) | Scrap search results, posts and images from subreddits filtered by hot, new etc and bulk download any user's data.
Awesome Deep Learning for Time-Series Imputation, including an unmissable paper list about applying neural networks to impute incomplete time series containing NaN missing values/data
Welcome to the Data Science EBooks repository! This collection offers a variety of high-quality ebooks on Data Science, Machine Learning, and AI. Perfect for both beginners and advanced learners, expl...
A professional list of Tutorials and Surveys on DL, ML, DM, CV, NLP, Speech in top AI conferences and journals.
Research and development (R&D) is crucial for the enhancement of industrial productivity, especially in the AI era, where the core aspects of R&D are mainly focused on data and models. We are committe...
RAVEN is a flexible and multi-purpose probabilistic risk analysis, validation and uncertainty quantification, parameter optimization, model reduction and data knowledge-discovering framework.
A Python Library for Outlier and Anomaly Detection, Integrating Classical and Deep Learning Techniques
A Python toolkit/library for reality-centric machine/deep learning and data mining on partially-observed time series, including SOTA neural network models for scientific analysis tasks of imputation/c...
UnityPy is python module that makes it possible to extract/unpack and edit Unity assets
A professional list of Papers, Tutorials, and Surveys on AI for Time Series in top AI conferences and journals.
The only open-source toolkit that can download SEC EDGAR financial reports and extract textual data from specific item sections into nice & clean structured JSON files.
This toolbox offers 13 wrapper feature selection methods (PSO, GA, GWO, HHO, BA, WOA, and etc.) with examples. It is simple and easy to implement.
A professional list of Papers, Tutorials, and Surveys on AI for Time Series in top AI conferences and journals.
Awesome Deep Graph Clustering is a collection of SOTA, novel deep graph clustering methods (papers, codes, and datasets).
Research and development (R&D) is crucial for the enhancement of industrial productivity, especially in the AI era, where the core aspects of R&D are mainly focused on data and models. We are committe...
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
人工智能学习路线图,整理近200个实战案例与项目,免费提供配套教材,零基础入门,就业实战!包括:Python,数学,机器学习,数据分析,深度学习,计算机视觉,自然语言处理,PyTorch tensorflow machine-learning,deep-learning data-analysis data-mining mathematics data-science artificial-in...
:memo: An awesome Data Science repository to learn and apply for real world problems.
A Python Library for Outlier and Anomaly Detection, Integrating Classical and Deep Learning Techniques
A curated list of awesome open source libraries to deploy, monitor, version and scale your machine learning
A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tas...
Machine Learning From Scratch. Bare bones NumPy implementations of machine learning models and algorithms with a focus on accessibility. Aims to cover everything from linear regression to deep learnin...
A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other machine learning tasks for Python, R, Java, C++. Supports computa...
Anomaly detection related books, papers, videos, and toolboxes
A Python toolkit/library for reality-centric machine/deep learning and data mining on partially-observed time series, including SOTA neural network models for scientific analysis tasks of imputation/c...
An open source alternative to Tableau. Embeddable visual analytic
A professional list of Papers, Tutorials, and Surveys on AI for Time Series in top AI conferences and journals.
Welcome to the Data Science EBooks repository! This collection offers a variety of high-quality ebooks on Data Science, Machine Learning, and AI. Perfect for both beginners and advanced learners, expl...
Comprehensive and timely academic information on federated learning (papers, frameworks, datasets, tutorials, workshops)
Research and development (R&D) is crucial for the enhancement of industrial productivity, especially in the AI era, where the core aspects of R&D are mainly focused on data and models. We are committe...
Website-downloader is a powerful and versatile Python script designed to download entire websites along with all their assets. This tool allows you to create a local copy of a website, including HTML ...
A real-world dataset for EV-related research, e.g., spatiotemporal prediction and urban energy management.
Machine Learning Roadmap for 2025. Step-by-step guide to become a Data Scientist. Covers the best free learning resources from Python basics to Deep Learning and MLOps.
Yet Another Reddit Scrapper (without API keys) | Scrap search results, posts and images from subreddits filtered by hot, new etc and bulk download any user's data.
Astrostatistics and Machine Learning class for the MSc degree in Astrophysics at the University of Milan-Bicocca (Italy)
📖 A curated list of awesome time-series papers, benchmarks, datasets, tutorials. (WIP)
C# KQL query engine with flexible I/O layers and visualization
The tutorials for PyPOTS, guide you to model partially-observed time series datasets.
Free Facebook pages MetaData Scraping Library - Unlimited Calls
🔥🔥🔥 Latest Advances on Large Recommendation Models
Welcome to the Data Science EBooks repository! This collection offers a variety of high-quality ebooks on Data Science, Machine Learning, and AI. Perfect for both beginners and advanced learners, expl...
This repository contains the time series segmentation benchmark (TSSB).
Awesome Deep Learning for Time-Series Imputation, including an unmissable paper list about applying neural networks to impute incomplete time series containing NaN missing values/data
Chrome Extension, download photos, videos from Instagram post, tv, reels, stories
The only open-source toolkit that can download SEC EDGAR financial reports and extract textual data from specific item sections into nice & clean structured JSON files.
PyGrinder: a Python toolkit for grinding data beans into the incomplete for real-world data simulation by introducing missing values with different missingness patterns, including MCAR (complete at ra...
A project providing a Graphic Walker Pane for use with HoloViz Panel.
🔥🔥🔥 Latest Advances on Large Recommendation Models
Machine Learning Roadmap for 2025. Step-by-step guide to become a Data Scientist. Covers the best free learning resources from Python basics to Deep Learning and MLOps.
A curated list of valuable resources from our studies at the University of Tehran (UT), School of Electrical and Computer Engineering (ECE)
Yet Another Reddit Scrapper (without API keys) | Scrap search results, posts and images from subreddits filtered by hot, new etc and bulk download any user's data.
Website-downloader is a powerful and versatile Python script designed to download entire websites along with all their assets. This tool allows you to create a local copy of a website, including HTML ...
📖 A curated list of awesome time-series papers, benchmarks, datasets, tutorials. (WIP)
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
Research and development (R&D) is crucial for the enhancement of industrial productivity, especially in the AI era, where the core aspects of R&D are mainly focused on data and models. We are committe...
:memo: An awesome Data Science repository to learn and apply for real world problems.
人工智能学习路线图,整理近200个实战案例与项目,免费提供配套教材,零基础入门,就业实战!包括:Python,数学,机器学习,数据分析,深度学习,计算机视觉,自然语言处理,PyTorch tensorflow machine-learning,deep-learning data-analysis data-mining mathematics data-science artificial-in...
A curated list of awesome open source libraries to deploy, monitor, version and scale your machine learning
Machine Learning From Scratch. Bare bones NumPy implementations of machine learning models and algorithms with a focus on accessibility. Aims to cover everything from linear regression to deep learnin...
A Python Library for Outlier and Anomaly Detection, Integrating Classical and Deep Learning Techniques
A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tas...
Anomaly detection related books, papers, videos, and toolboxes
A Python toolkit/library for reality-centric machine/deep learning and data mining on partially-observed time series, including SOTA neural network models for scientific analysis tasks of imputation/c...
A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other machine learning tasks for Python, R, Java, C++. Supports computa...
An open source alternative to Tableau. Embeddable visual analytic
Welcome to the Data Science EBooks repository! This collection offers a variety of high-quality ebooks on Data Science, Machine Learning, and AI. Perfect for both beginners and advanced learners, expl...
Comprehensive and timely academic information on federated learning (papers, frameworks, datasets, tutorials, workshops)
A professional list of Papers, Tutorials, and Surveys on AI for Time Series in top AI conferences and journals.
Welcome to the Data Science EBooks repository! This collection offers a variety of high-quality ebooks on Data Science, Machine Learning, and AI. Perfect for both beginners and advanced learners, expl...
Desbordante is a high-performance data profiler that is capable of discovering many different patterns in data using various algorithms. It also allows to run data cleaning scenarios using these algor...
Awesome Deep Learning for Time-Series Imputation, including an unmissable paper list about applying neural networks to impute incomplete time series containing NaN missing values/data
A real-world dataset for EV-related research, e.g., spatiotemporal prediction and urban energy management.
Sportsbookreview.com scraper + complete 10Y games+odds data for NFL, NBA, NHL, MLB for bettors and sports analysts
Machine Learning Roadmap for 2025. Step-by-step guide to become a Data Scientist. Covers the best free learning resources from Python basics to Deep Learning and MLOps.
PyGrinder: a Python toolkit for grinding data beans into the incomplete for real-world data simulation by introducing missing values with different missingness patterns, including MCAR (complete at ra...
Free Facebook pages MetaData Scraping Library - Unlimited Calls
A Python toolkit/library for reality-centric machine/deep learning and data mining on partially-observed time series, including SOTA neural network models for scientific analysis tasks of imputation/c...
Astrostatistics and Machine Learning class for the MSc degree in Astrophysics at the University of Milan-Bicocca (Italy)
Chrome Extension, download photos, videos from Instagram post, tv, reels, stories
The tutorials for PyPOTS, guide you to model partially-observed time series datasets.
Extensive acceptance rates and information of main AI conferences
This repository contains a reading list of papers on Time Series Segmentation. This repository is still being continuously improved.