Statistics for topic reinforcement-learning
RepositoryStats tracks 579,129 Github repositories, of these 1,293 are tagged with the reinforcement-learning topic. The most common primary language for repositories using this topic is Python (875). Other languages include: Jupyter Notebook (173), C++ (47), C# (15), HTML (11)
Stargazers over time for topic reinforcement-learning
Most starred repositories for topic reinforcement-learning (view more)
Trending repositories for topic reinforcement-learning (view more)
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga...
A curated list of Artificial Intelligence (AI) courses, books, video lectures and papers.
Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.
Drench yourself in Deep Learning, Reinforcement Learning, Machine Learning, Computer Vision, and NLP by learning from these exciting lectures!!
《李宏毅深度学习教程》(李宏毅老师推荐👍,苹果书🍎),PDF下载地址:https://github.com/datawhalechina/leedl-tutorial/releases
Proximal Policy Optimization (PPO) algorithm using PyTorch to train an agent for a rocket landing task in a custom environment
Using DDPG and ConvLSTM to control a drone to avoid obstacle in AirSim
High Performance Map Matching with Markov Decision Processes (MDPs) and Hidden Markov Models (HMMs).
Python library for solving reinforcement learning (RL) problems using generative models (e.g. Diffusion Models).
Multi-Agent Deep Reinforcement Learning (MA-DRL) Routing Simulator for satellite networks
Proximal Policy Optimization (PPO) algorithm using PyTorch to train an agent for a rocket landing task in a custom environment
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga...
A curated list of Artificial Intelligence (AI) courses, books, video lectures and papers.
Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.
Drench yourself in Deep Learning, Reinforcement Learning, Machine Learning, Computer Vision, and NLP by learning from these exciting lectures!!
《李宏毅深度学习教程》(李宏毅老师推荐👍,苹果书🍎),PDF下载地址:https://github.com/datawhalechina/leedl-tutorial/releases
Proximal Policy Optimization (PPO) algorithm using PyTorch to train an agent for a rocket landing task in a custom environment
IntersectionZoo: Eco-driving for Benchmarking Multi-Agent Contextual Reinforcement Learning
Using DDPG and ConvLSTM to control a drone to avoid obstacle in AirSim
High Performance Map Matching with Markov Decision Processes (MDPs) and Hidden Markov Models (HMMs).
Proximal Policy Optimization (PPO) algorithm using PyTorch to train an agent for a rocket landing task in a custom environment
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga...
DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 Spotlight.
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.
Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.
DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 Spotlight.
[ICML 2024] Code for the paper "Confronting Reward Overoptimization for Diffusion Models: A Perspective of Inductive and Primacy Biases"
Q学习可视化工具使用强化学习来教智能体学习行动策略。用户可通过自定义地图和不同策略进行实验,并观察未训练和训练后的智能体在地图中的表现。启动模拟命令:python rlearn.py <地图名称>。
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.
DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 Spotlight.
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga...
《李宏毅深度学习教程》(李宏毅老师推荐👍,苹果书🍎),PDF下载地址:https://github.com/datawhalechina/leedl-tutorial/releases
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.
Top paper collection for stock price prediction, quantitative trading. Covering top conferences and journals like KDD, WWW, CIKM, AAAI, IJCAI, ACL, EMNLP.
[CVPR 2024] Official code for EgoGen: An Egocentric Synthetic Data Generator