Trending repositories for topic data-mining
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
Research and development (R&D) is crucial for the enhancement of industrial productivity, especially in the AI era, where the core aspects of R&D are mainly focused on data and models. We are committe...
人工智能学习路线图,整理近200个实战案例与项目,免费提供配套教材,零基础入门,就业实战!包括:Python,数学,机器学习,数据分析,深度学习,计算机视觉,自然语言处理,PyTorch tensorflow machine-learning,deep-learning data-analysis data-mining mathematics data-science artificial-in...
:memo: An awesome Data Science repository to learn and apply for real world problems.
A curated list of awesome open source libraries to deploy, monitor, version and scale your machine learning
Machine Learning From Scratch. Bare bones NumPy implementations of machine learning models and algorithms with a focus on accessibility. Aims to cover everything from linear regression to deep learnin...
A Python Library for Outlier and Anomaly Detection, Integrating Classical and Deep Learning Techniques
A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tas...
A Python toolkit/library for reality-centric machine/deep learning and data mining on partially-observed time series, including SOTA neural network models for scientific analysis tasks of imputation/c...
Anomaly detection related books, papers, videos, and toolboxes
A professional list of Papers, Tutorials, and Surveys on AI for Time Series in top AI conferences and journals.
Comprehensive and timely academic information on federated learning (papers, frameworks, datasets, tutorials, workshops)
An open source alternative to Tableau. Embeddable visual analytic
A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other machine learning tasks for Python, R, Java, C++. Supports computa...
A real-world dataset for EV-related research, e.g., spatiotemporal prediction and urban energy management.
🔥🔥🔥 Latest Advances on Large Recommendation Models
A real-world dataset for EV-related research, e.g., spatiotemporal prediction and urban energy management.
Research and development (R&D) is crucial for the enhancement of industrial productivity, especially in the AI era, where the core aspects of R&D are mainly focused on data and models. We are committe...
🔥🔥🔥 Latest Advances on Large Recommendation Models
A Telegram crawler made in Python to automatically search groups and channels and collect any type of data from them (+ dataset included).
a Python toolbox loads 172 public time series datasets for machine/deep learning with a single line of code. Datasets from multiple domains including healthcare, financial, power, traffic, weather, an...
Chrome Extension, download photos, videos from Instagram post, tv, reels, stories
A Python toolkit/library for reality-centric machine/deep learning and data mining on partially-observed time series, including SOTA neural network models for scientific analysis tasks of imputation/c...
A professional list of Papers, Tutorials, and Surveys on AI for Time Series in top AI conferences and journals.
Reproducible Machine Learning for Credit Card Fraud Detection - Practical Handbook
Comprehensive and timely academic information on federated learning (papers, frameworks, datasets, tutorials, workshops)
Node graphs, OSINT data mining, and plugins. Connect unstructured and public data for transformative insights
人工智能学习路线图,整理近200个实战案例与项目,免费提供配套教材,零基础入门,就业实战!包括:Python,数学,机器学习,数据分析,深度学习,计算机视觉,自然语言处理,PyTorch tensorflow machine-learning,deep-learning data-analysis data-mining mathematics data-science artificial-in...
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
An open source alternative to Tableau. Embeddable visual analytic
Welcome to the Data Science EBooks repository! This collection offers a variety of high-quality ebooks on Data Science, Machine Learning, and AI. Perfect for both beginners and advanced learners, expl...
A Python Library for Outlier and Anomaly Detection, Integrating Classical and Deep Learning Techniques
UnityPy is python module that makes it possible to extract/unpack and edit Unity assets
Research and development (R&D) is crucial for the enhancement of industrial productivity, especially in the AI era, where the core aspects of R&D are mainly focused on data and models. We are committe...
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
人工智能学习路线图,整理近200个实战案例与项目,免费提供配套教材,零基础入门,就业实战!包括:Python,数学,机器学习,数据分析,深度学习,计算机视觉,自然语言处理,PyTorch tensorflow machine-learning,deep-learning data-analysis data-mining mathematics data-science artificial-in...
:memo: An awesome Data Science repository to learn and apply for real world problems.
A curated list of awesome open source libraries to deploy, monitor, version and scale your machine learning
A Python Library for Outlier and Anomaly Detection, Integrating Classical and Deep Learning Techniques
A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tas...
Machine Learning From Scratch. Bare bones NumPy implementations of machine learning models and algorithms with a focus on accessibility. Aims to cover everything from linear regression to deep learnin...
Anomaly detection related books, papers, videos, and toolboxes
A Python toolkit/library for reality-centric machine/deep learning and data mining on partially-observed time series, including SOTA neural network models for scientific analysis tasks of imputation/c...
Comprehensive and timely academic information on federated learning (papers, frameworks, datasets, tutorials, workshops)
Node graphs, OSINT data mining, and plugins. Connect unstructured and public data for transformative insights
UnityPy is python module that makes it possible to extract/unpack and edit Unity assets
Research and development (R&D) is crucial for the enhancement of industrial productivity, especially in the AI era, where the core aspects of R&D are mainly focused on data and models. We are committe...
A real-world dataset for EV-related research, e.g., spatiotemporal prediction and urban energy management.
Yet Another Reddit Scrapper (without API keys) | Scrap search results, posts and images from subreddits filtered by hot, new etc and bulk download any user's data.
🔥🔥🔥 Latest Advances on Large Recommendation Models
gBolt--very fast implementation for gSpan algorithm in data mining
The only open-source toolkit that can download EDGAR financial reports and extract textual data from specific item sections into nice and clean JSON files.
a Python toolbox loads 172 public time series datasets for machine/deep learning with a single line of code. Datasets from multiple domains including healthcare, financial, power, traffic, weather, an...
Welcome to the Data Science EBooks repository! This collection offers a variety of high-quality ebooks on Data Science, Machine Learning, and AI. Perfect for both beginners and advanced learners, expl...
Node graphs, OSINT data mining, and plugins. Connect unstructured and public data for transformative insights
Reproducible Machine Learning for Credit Card Fraud Detection - Practical Handbook
Awesome Deep Learning for Time-Series Imputation, including a must-read paper list about applying neural networks to impute incomplete time series containing NaN missing values/data
UnityPy is python module that makes it possible to extract/unpack and edit Unity assets
机器学习+大数据+数据安全:数据安全ai智能风险监测,风控,反欺诈,,api安全,web安全资料收集,致力于打造智能数据安全领域领先的学习资料库,收集不易,欢迎star。 Machine learning + big data + data security: data security AI intelligent risk monitoring, web / api security,...
A Python toolkit/library for reality-centric machine/deep learning and data mining on partially-observed time series, including SOTA neural network models for scientific analysis tasks of imputation/c...
A Telegram crawler made in Python to automatically search groups and channels and collect any type of data from them (+ dataset included).
An End-to-End Benchmark Suite for Univariate Time-Series Anomaly Detection
Chrome Extension, download photos, videos from Instagram post, tv, reels, stories
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
Research and development (R&D) is crucial for the enhancement of industrial productivity, especially in the AI era, where the core aspects of R&D are mainly focused on data and models. We are committe...
A curated list of awesome open source libraries to deploy, monitor, version and scale your machine learning
:memo: An awesome Data Science repository to learn and apply for real world problems.
人工智能学习路线图,整理近200个实战案例与项目,免费提供配套教材,零基础入门,就业实战!包括:Python,数学,机器学习,数据分析,深度学习,计算机视觉,自然语言处理,PyTorch tensorflow machine-learning,deep-learning data-analysis data-mining mathematics data-science artificial-in...
Machine Learning From Scratch. Bare bones NumPy implementations of machine learning models and algorithms with a focus on accessibility. Aims to cover everything from linear regression to deep learnin...
A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tas...
A Python Library for Outlier and Anomaly Detection, Integrating Classical and Deep Learning Techniques
A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other machine learning tasks for Python, R, Java, C++. Supports computa...
Welcome to the Data Science EBooks repository! This collection offers a variety of high-quality ebooks on Data Science, Machine Learning, and AI. Perfect for both beginners and advanced learners, expl...
A Python toolkit/library for reality-centric machine/deep learning and data mining on partially-observed time series, including SOTA neural network models for scientific analysis tasks of imputation/c...
Highly cited and useful papers related to machine learning, deep learning, AI, game theory, reinforcement learning
Anomaly detection related books, papers, videos, and toolboxes
Some awesome AI related books and pdfs for learning and downloading, also apply some playground models for learning
Yet Another Reddit Scrapper (without API keys) | Scrap search results, posts and images from subreddits filtered by hot, new etc and bulk download any user's data.
Research and development (R&D) is crucial for the enhancement of industrial productivity, especially in the AI era, where the core aspects of R&D are mainly focused on data and models. We are committe...
Chrome Extension, download photos, videos from Instagram post, tv, reels, stories
🔥🔥🔥 Latest Advances on Large Recommendation Models
A real-world dataset for EV-related research, e.g., spatiotemporal prediction and urban energy management.
Welcome to the Data Science EBooks repository! This collection offers a variety of high-quality ebooks on Data Science, Machine Learning, and AI. Perfect for both beginners and advanced learners, expl...
C# KQL query engine with flexible I/O layers and visualization
The only open-source toolkit that can download EDGAR financial reports and extract textual data from specific item sections into nice and clean JSON files.
Awesome Deep Learning for Time-Series Imputation, including a must-read paper list about applying neural networks to impute incomplete time series containing NaN missing values/data
A Telegram crawler made in Python to automatically search groups and channels and collect any type of data from them (+ dataset included).
Compute the Pareto (non-dominated) set, i.e., skyline operator/query.
Парсер сайта 2GIS для сбора адресов и контактов предприятий России и стран СНГ
PyTrial: A Comprehensive Platform for Artificial Intelligence for Drug Development
A Python toolkit/library for reality-centric machine/deep learning and data mining on partially-observed time series, including SOTA neural network models for scientific analysis tasks of imputation/c...
Автоматизация поиска и исследования вакансий с сайта hh.ru (Headhunter) с помощью методов Python. Классификация данных, поиск статистических параметров.
The tutorials for PyPOTS, guide you to model partially-observed time series datasets.
Scrape data from Goodreads using Scrapy and Selenium :books:
Research and development (R&D) is crucial for the enhancement of industrial productivity, especially in the AI era, where the core aspects of R&D are mainly focused on data and models. We are committe...
A project providing a Graphic Walker Pane for use with HoloViz Panel.
🔥🔥🔥 Latest Advances on Large Recommendation Models
A curated list of valuable resources from our studies at the University of Tehran (UT), School of Electrical and Computer Engineering (ECE)
Yet Another Reddit Scrapper (without API keys) | Scrap search results, posts and images from subreddits filtered by hot, new etc and bulk download any user's data.
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
:memo: An awesome Data Science repository to learn and apply for real world problems.
A curated list of awesome open source libraries to deploy, monitor, version and scale your machine learning
人工智能学习路线图,整理近200个实战案例与项目,免费提供配套教材,零基础入门,就业实战!包括:Python,数学,机器学习,数据分析,深度学习,计算机视觉,自然语言处理,PyTorch tensorflow machine-learning,deep-learning data-analysis data-mining mathematics data-science artificial-in...
Research and development (R&D) is crucial for the enhancement of industrial productivity, especially in the AI era, where the core aspects of R&D are mainly focused on data and models. We are committe...
Machine Learning From Scratch. Bare bones NumPy implementations of machine learning models and algorithms with a focus on accessibility. Aims to cover everything from linear regression to deep learnin...
A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tas...
A Python Library for Outlier and Anomaly Detection, Integrating Classical and Deep Learning Techniques
Anomaly detection related books, papers, videos, and toolboxes
A Python toolkit/library for reality-centric machine/deep learning and data mining on partially-observed time series, including SOTA neural network models for scientific analysis tasks of imputation/c...
A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other machine learning tasks for Python, R, Java, C++. Supports computa...
An open source alternative to Tableau. Embeddable visual analytic
Welcome to the Data Science EBooks repository! This collection offers a variety of high-quality ebooks on Data Science, Machine Learning, and AI. Perfect for both beginners and advanced learners, expl...
Comprehensive and timely academic information on federated learning (papers, frameworks, datasets, tutorials, workshops)
A professional list of Papers, Tutorials, and Surveys on AI for Time Series in top AI conferences and journals.
Welcome to the Data Science EBooks repository! This collection offers a variety of high-quality ebooks on Data Science, Machine Learning, and AI. Perfect for both beginners and advanced learners, expl...
Awesome Deep Learning for Time-Series Imputation, including a must-read paper list about applying neural networks to impute incomplete time series containing NaN missing values/data
A real-world dataset for EV-related research, e.g., spatiotemporal prediction and urban energy management.
Desbordante is a high-performance data profiler that is capable of discovering many different patterns in data using various algorithms. It also allows to run data cleaning scenarios using these algor...
Sportsbookreview.com scraper + complete 10Y games+odds data for NFL, NBA, NHL, MLB for bettors and sports analysts
This repository contains a reading list of papers on Time Series Segmentation. This repository is still being continuously improved.
PyGrinder: a Python toolkit for grinding data beans into the incomplete for real-world data simulation by introducing missing values with different missingness patterns, including MCAR (complete at ra...
Extensive acceptance rates and information of main AI conferences
Free Facebook pages MetaData Scraping Library - Unlimited Calls
A Python toolkit/library for reality-centric machine/deep learning and data mining on partially-observed time series, including SOTA neural network models for scientific analysis tasks of imputation/c...
Chrome Extension, download photos, videos from Instagram post, tv, reels, stories
The tutorials for PyPOTS, guide you to model partially-observed time series datasets.
Scraping Wikipedia by combining LangChain's agents and tools with OpenAI's LLMs and function calling
a Python toolbox loads 172 public time series datasets for machine/deep learning with a single line of code. Datasets from multiple domains including healthcare, financial, power, traffic, weather, an...