Trending repositories for topic data-mining
Research and development (R&D) is crucial for the enhancement of industrial productivity, especially in the AI era, where the core aspects of R&D are mainly focused on data and models. We are committe...
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other machine learning tasks for Python, R, Java, C++. Supports computa...
:memo: An awesome Data Science repository to learn and apply for real world problems.
人工智能学习路线图,整理近200个实战案例与项目,免费提供配套教材,零基础入门,就业实战!包括:Python,数学,机器学习,数据分析,深度学习,计算机视觉,自然语言处理,PyTorch tensorflow machine-learning,deep-learning data-analysis data-mining mathematics data-science artificial-in...
A curated list of awesome open source libraries to deploy, monitor, version and scale your machine learning
A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tas...
An open source alternative to Tableau. Embeddable visual analytic
A Python Library for Outlier and Anomaly Detection, Integrating Classical and Deep Learning Techniques
Comprehensive and timely academic information on federated learning (papers, frameworks, datasets, tutorials, workshops)
Welcome to the Data Science EBooks repository! This collection offers a variety of high-quality ebooks on Data Science, Machine Learning, and AI. Perfect for both beginners and advanced learners, expl...
Awesome Deep Graph Clustering is a collection of SOTA, novel deep graph clustering methods (papers, codes, and datasets).
Anomaly detection related books, papers, videos, and toolboxes
The "Python Machine Learning (1st edition)" book code repository and info resource
Machine Learning From Scratch. Bare bones NumPy implementations of machine learning models and algorithms with a focus on accessibility. Aims to cover everything from linear regression to deep learnin...
PyTrial: A Comprehensive Platform for Artificial Intelligence for Drug Development
Research and development (R&D) is crucial for the enhancement of industrial productivity, especially in the AI era, where the core aspects of R&D are mainly focused on data and models. We are committe...
A real-world dataset for EV-related research, e.g., spatiotemporal prediction and urban energy management.
An End-to-End Benchmark Suite for Univariate Time-Series Anomaly Detection
Welcome to the Data Science EBooks repository! This collection offers a variety of high-quality ebooks on Data Science, Machine Learning, and AI. Perfect for both beginners and advanced learners, expl...
Awesome Deep Graph Clustering is a collection of SOTA, novel deep graph clustering methods (papers, codes, and datasets).
The only open-source toolkit that can download SEC EDGAR financial reports and extract textual data from specific item sections into nice & clean structured JSON files.
Comprehensive and timely academic information on federated learning (papers, frameworks, datasets, tutorials, workshops)
GrimoireLab: platform for software development analytics and insights
An open source alternative to Tableau. Embeddable visual analytic
A Python toolkit/library for reality-centric machine/deep learning and data mining on partially-observed time series, including SOTA neural network models for scientific analysis tasks of imputation/c...
A professional list of Papers, Tutorials, and Surveys on AI for Time Series in top AI conferences and journals.
Feature engineering is the process of using domain knowledge to extract features from raw data via data mining techniques. These features can be used to improve the performance of machine learning alg...
A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other machine learning tasks for Python, R, Java, C++. Supports computa...
Official Implement of "ADBench: Anomaly Detection Benchmark", NeurIPS 2022.
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
Research and development (R&D) is crucial for the enhancement of industrial productivity, especially in the AI era, where the core aspects of R&D are mainly focused on data and models. We are committe...
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
人工智能学习路线图,整理近200个实战案例与项目,免费提供配套教材,零基础入门,就业实战!包括:Python,数学,机器学习,数据分析,深度学习,计算机视觉,自然语言处理,PyTorch tensorflow machine-learning,deep-learning data-analysis data-mining mathematics data-science artificial-in...
:memo: An awesome Data Science repository to learn and apply for real world problems.
A curated list of awesome open source libraries to deploy, monitor, version and scale your machine learning
A Python Library for Outlier and Anomaly Detection, Integrating Classical and Deep Learning Techniques
Machine Learning From Scratch. Bare bones NumPy implementations of machine learning models and algorithms with a focus on accessibility. Aims to cover everything from linear regression to deep learnin...
A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tas...
A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other machine learning tasks for Python, R, Java, C++. Supports computa...
Anomaly detection related books, papers, videos, and toolboxes
An open source alternative to Tableau. Embeddable visual analytic
A Python toolkit/library for reality-centric machine/deep learning and data mining on partially-observed time series, including SOTA neural network models for scientific analysis tasks of imputation/c...
Comprehensive and timely academic information on federated learning (papers, frameworks, datasets, tutorials, workshops)
A real-world dataset for EV-related research, e.g., spatiotemporal prediction and urban energy management.
Research and development (R&D) is crucial for the enhancement of industrial productivity, especially in the AI era, where the core aspects of R&D are mainly focused on data and models. We are committe...
A real-world dataset for EV-related research, e.g., spatiotemporal prediction and urban energy management.
Machine Learning Roadmap for 2025. Step-by-step guide to become a Data Scientist. Covers the best free learning resources from Python basics to Deep Learning and MLOps.
a collection of awesome machine learning and deep learning Python libraries&tools. 热门实用机器学习和深入学习Python库和工具的集合
An official source code for paper "Graph Anomaly Detection via Multi-Scale Contrastive Learning Networks with Augmented View", accepted by AAAI 2023.
The tutorials for PyPOTS, guide you to model partially-observed time series datasets.
Awesome Deep Learning for Time-Series Imputation, including an unmissable paper list about applying neural networks to impute incomplete time series containing NaN missing values/data
PyTrial: A Comprehensive Platform for Artificial Intelligence for Drug Development
Chrome Extension, download photos, videos from Instagram post, tv, reels, stories
Welcome to the Data Science EBooks repository! This collection offers a variety of high-quality ebooks on Data Science, Machine Learning, and AI. Perfect for both beginners and advanced learners, expl...
Node graphs, OSINT data mining, and plugins. Connect unstructured and public data for transformative insights
A Python toolkit/library for reality-centric machine/deep learning and data mining on partially-observed time series, including SOTA neural network models for scientific analysis tasks of imputation/c...
Awesome Deep Graph Clustering is a collection of SOTA, novel deep graph clustering methods (papers, codes, and datasets).
A Telegram crawler made in Python to automatically search groups and channels and collect any type of data from them (+ dataset included).
A professional list of Papers, Tutorials, and Surveys on AI for Time Series in top AI conferences and journals.
Research and development (R&D) is crucial for the enhancement of industrial productivity, especially in the AI era, where the core aspects of R&D are mainly focused on data and models. We are committe...
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
人工智能学习路线图,整理近200个实战案例与项目,免费提供配套教材,零基础入门,就业实战!包括:Python,数学,机器学习,数据分析,深度学习,计算机视觉,自然语言处理,PyTorch tensorflow machine-learning,deep-learning data-analysis data-mining mathematics data-science artificial-in...
:memo: An awesome Data Science repository to learn and apply for real world problems.
A curated list of awesome open source libraries to deploy, monitor, version and scale your machine learning
A Python Library for Outlier and Anomaly Detection, Integrating Classical and Deep Learning Techniques
A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tas...
Machine Learning From Scratch. Bare bones NumPy implementations of machine learning models and algorithms with a focus on accessibility. Aims to cover everything from linear regression to deep learnin...
A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other machine learning tasks for Python, R, Java, C++. Supports computa...
Anomaly detection related books, papers, videos, and toolboxes
A Python toolkit/library for reality-centric machine/deep learning and data mining on partially-observed time series, including SOTA neural network models for scientific analysis tasks of imputation/c...
An open source alternative to Tableau. Embeddable visual analytic
Machine Learning Roadmap for 2025. Step-by-step guide to become a Data Scientist. Covers the best free learning resources from Python basics to Deep Learning and MLOps.
A professional list of Papers, Tutorials, and Surveys on AI for Time Series in top AI conferences and journals.
Research and development (R&D) is crucial for the enhancement of industrial productivity, especially in the AI era, where the core aspects of R&D are mainly focused on data and models. We are committe...
Machine Learning Roadmap for 2025. Step-by-step guide to become a Data Scientist. Covers the best free learning resources from Python basics to Deep Learning and MLOps.
Website-downloader is a powerful and versatile Python script designed to download entire websites along with all their assets. This tool allows you to create a local copy of a website, including HTML ...
A real-world dataset for EV-related research, e.g., spatiotemporal prediction and urban energy management.
Astrostatistics and Machine Learning class for the MSc degree in Astrophysics at the University of Milan-Bicocca (Italy)
The tutorials for PyPOTS, guide you to model partially-observed time series datasets.
C# KQL query engine with flexible I/O layers and visualization
Yet Another Reddit Scrapper (without API keys) | Scrap search results, posts and images from subreddits filtered by hot, new etc and bulk download any user's data.
Astrostatistics and Machine Learning class for the MSc degree in Astrophysics at the University of Milan-Bicocca (Italy)
Chrome Extension, download photos, videos from Instagram post, tv, reels, stories
Welcome to the Data Science EBooks repository! This collection offers a variety of high-quality ebooks on Data Science, Machine Learning, and AI. Perfect for both beginners and advanced learners, expl...
An official source code for paper "Graph Anomaly Detection via Multi-Scale Contrastive Learning Networks with Augmented View", accepted by AAAI 2023.
Compute the Pareto (non-dominated) set, i.e., skyline operator/query.
Road Network Enhanced Trajectory Recovery with Spatial-Temporal Transformer (ICDE'23)
Awesome Deep Learning for Time-Series Imputation, including an unmissable paper list about applying neural networks to impute incomplete time series containing NaN missing values/data
Research and development (R&D) is crucial for the enhancement of industrial productivity, especially in the AI era, where the core aspects of R&D are mainly focused on data and models. We are committe...
A project providing a Graphic Walker Pane for use with HoloViz Panel.
🔥🔥🔥 Latest Advances on Large Recommendation Models
Machine Learning Roadmap for 2025. Step-by-step guide to become a Data Scientist. Covers the best free learning resources from Python basics to Deep Learning and MLOps.
A curated list of valuable resources from our studies at the University of Tehran (UT), School of Electrical and Computer Engineering (ECE)
Yet Another Reddit Scrapper (without API keys) | Scrap search results, posts and images from subreddits filtered by hot, new etc and bulk download any user's data.
Website-downloader is a powerful and versatile Python script designed to download entire websites along with all their assets. This tool allows you to create a local copy of a website, including HTML ...
📖 A curated list of awesome time-series papers, benchmarks, datasets, tutorials. (WIP)
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
Research and development (R&D) is crucial for the enhancement of industrial productivity, especially in the AI era, where the core aspects of R&D are mainly focused on data and models. We are committe...
:memo: An awesome Data Science repository to learn and apply for real world problems.
人工智能学习路线图,整理近200个实战案例与项目,免费提供配套教材,零基础入门,就业实战!包括:Python,数学,机器学习,数据分析,深度学习,计算机视觉,自然语言处理,PyTorch tensorflow machine-learning,deep-learning data-analysis data-mining mathematics data-science artificial-in...
A curated list of awesome open source libraries to deploy, monitor, version and scale your machine learning
Machine Learning From Scratch. Bare bones NumPy implementations of machine learning models and algorithms with a focus on accessibility. Aims to cover everything from linear regression to deep learnin...
A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tas...
A Python Library for Outlier and Anomaly Detection, Integrating Classical and Deep Learning Techniques
Anomaly detection related books, papers, videos, and toolboxes
A Python toolkit/library for reality-centric machine/deep learning and data mining on partially-observed time series, including SOTA neural network models for scientific analysis tasks of imputation/c...
A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other machine learning tasks for Python, R, Java, C++. Supports computa...
An open source alternative to Tableau. Embeddable visual analytic
Welcome to the Data Science EBooks repository! This collection offers a variety of high-quality ebooks on Data Science, Machine Learning, and AI. Perfect for both beginners and advanced learners, expl...
Comprehensive and timely academic information on federated learning (papers, frameworks, datasets, tutorials, workshops)
Highly cited and useful papers related to machine learning, deep learning, AI, game theory, reinforcement learning
Welcome to the Data Science EBooks repository! This collection offers a variety of high-quality ebooks on Data Science, Machine Learning, and AI. Perfect for both beginners and advanced learners, expl...
Desbordante is a high-performance data profiler that is capable of discovering many different patterns in data using various algorithms. It also allows to run data cleaning scenarios using these algor...
Awesome Deep Learning for Time-Series Imputation, including an unmissable paper list about applying neural networks to impute incomplete time series containing NaN missing values/data
A real-world dataset for EV-related research, e.g., spatiotemporal prediction and urban energy management.
Sportsbookreview.com scraper + complete 10Y games+odds data for NFL, NBA, NHL, MLB for bettors and sports analysts
PyGrinder: a Python toolkit for grinding data beans into the incomplete for real-world data simulation by introducing missing values with different missingness patterns, including MCAR (complete at ra...
Machine Learning Roadmap for 2025. Step-by-step guide to become a Data Scientist. Covers the best free learning resources from Python basics to Deep Learning and MLOps.
Astrostatistics and Machine Learning class for the MSc degree in Astrophysics at the University of Milan-Bicocca (Italy)
Free Facebook pages MetaData Scraping Library - Unlimited Calls
The tutorials for PyPOTS, guide you to model partially-observed time series datasets.
A Python toolkit/library for reality-centric machine/deep learning and data mining on partially-observed time series, including SOTA neural network models for scientific analysis tasks of imputation/c...
Extensive acceptance rates and information of main AI conferences
This repository contains a reading list of papers on Time Series Segmentation. This repository is still being continuously improved.
Chrome Extension, download photos, videos from Instagram post, tv, reels, stories