Trending repositories for topic reinforcement-learning
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga...
A curated list of Artificial Intelligence (AI) courses, books, video lectures and papers.
Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.
Drench yourself in Deep Learning, Reinforcement Learning, Machine Learning, Computer Vision, and NLP by learning from these exciting lectures!!
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
《李宏毅深度学习教程》(李宏毅老师推荐👍,苹果书🍎),PDF下载地址:https://github.com/datawhalechina/leedl-tutorial/releases
DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 Spotlight.
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.
Proximal Policy Optimization (PPO) algorithm using PyTorch to train an agent for a rocket landing task in a custom environment
An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)
This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
Clean, Robust, and Unified PyTorch implementation of popular Deep Reinforcement Learning (DRL) algorithms (Q-learning, Duel DDQN, PER, C51, Noisy DQN, PPO, DDPG, TD3, SAC, ASL)
📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.
robosuite: A Modular Simulation Framework and Benchmark for Robot Learning
The Unity Machine Learning Agents Toolkit (ML-Agents) is an open-source project that enables games and simulations to serve as environments for training intelligent agents using deep reinforcement lea...
Proximal Policy Optimization (PPO) algorithm using PyTorch to train an agent for a rocket landing task in a custom environment
Using DDPG and ConvLSTM to control a drone to avoid obstacle in AirSim
High Performance Map Matching with Markov Decision Processes (MDPs) and Hidden Markov Models (HMMs).
Python library for solving reinforcement learning (RL) problems using generative models (e.g. Diffusion Models).
Multi-Agent Deep Reinforcement Learning (MA-DRL) Routing Simulator for satellite networks
Robot arm control using reinforcement learning algorithms : DDPG and TD3 with hindsight experience replay (HER)
Use Multi-agent Twin Delayed Deep Deterministic Policy Gradient(TD3) algorithm to find reasonable paths for ships
SCOPE-RL: A python library for offline reinforcement learning, off-policy evaluation, and selection
[NeurIPS 2024] GenRL: Multimodal foundation world models allow grounding language and video prompts into embodied domains, by turning them into sequences of latent world model states. Latent state seq...
Curated list of technical blogs on machine learning · AI/ML/DL/CV/NLP/MLOps
[NeurIPS 2022] PerfectDou: Dominating DouDizhu with Perfect Information Distillation
Chargym simulates the operation of an electric vehicle charging station (EVCS) considering random EV arrivals and departures within a day. This is a generalised environment for charging/discharging E...
A Fast, Portable Deep Reinforcement Learning Library for Continuous Control
Proximal Policy Optimization (PPO) algorithm using PyTorch to train an agent for a rocket landing task in a custom environment
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga...
A curated list of Artificial Intelligence (AI) courses, books, video lectures and papers.
Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.
Drench yourself in Deep Learning, Reinforcement Learning, Machine Learning, Computer Vision, and NLP by learning from these exciting lectures!!
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
《李宏毅深度学习教程》(李宏毅老师推荐👍,苹果书🍎),PDF下载地址:https://github.com/datawhalechina/leedl-tutorial/releases
DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 Spotlight.
Proximal Policy Optimization (PPO) algorithm using PyTorch to train an agent for a rocket landing task in a custom environment
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.
An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)
This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
Clean, Robust, and Unified PyTorch implementation of popular Deep Reinforcement Learning (DRL) algorithms (Q-learning, Duel DDQN, PER, C51, Noisy DQN, PPO, DDPG, TD3, SAC, ASL)
📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.
robosuite: A Modular Simulation Framework and Benchmark for Robot Learning
The Unity Machine Learning Agents Toolkit (ML-Agents) is an open-source project that enables games and simulations to serve as environments for training intelligent agents using deep reinforcement lea...
Proximal Policy Optimization (PPO) algorithm using PyTorch to train an agent for a rocket landing task in a custom environment
IntersectionZoo: Eco-driving for Benchmarking Multi-Agent Contextual Reinforcement Learning
Using DDPG and ConvLSTM to control a drone to avoid obstacle in AirSim
High Performance Map Matching with Markov Decision Processes (MDPs) and Hidden Markov Models (HMMs).
Python library for solving reinforcement learning (RL) problems using generative models (e.g. Diffusion Models).
Official code for ICML 2024 paper, "RIME: Robust Preference-based Reinforcement Learning with Noisy Preferences" (ICML 2024 Spotlight)
Multi-Agent Deep Reinforcement Learning (MA-DRL) Routing Simulator for satellite networks
Robot arm control using reinforcement learning algorithms : DDPG and TD3 with hindsight experience replay (HER)
Use Multi-agent Twin Delayed Deep Deterministic Policy Gradient(TD3) algorithm to find reasonable paths for ships
SCOPE-RL: A python library for offline reinforcement learning, off-policy evaluation, and selection
[NeurIPS 2024] GenRL: Multimodal foundation world models allow grounding language and video prompts into embodied domains, by turning them into sequences of latent world model states. Latent state seq...
Curated list of technical blogs on machine learning · AI/ML/DL/CV/NLP/MLOps
[NeurIPS 2022] PerfectDou: Dominating DouDizhu with Perfect Information Distillation
Proximal Policy Optimization (PPO) algorithm using PyTorch to train an agent for a rocket landing task in a custom environment
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga...
DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 Spotlight.
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.
Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
《李宏毅深度学习教程》(李宏毅老师推荐👍,苹果书🍎),PDF下载地址:https://github.com/datawhalechina/leedl-tutorial/releases
An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention)
A curated list of Artificial Intelligence (AI) courses, books, video lectures and papers.
FinGPT: Open-Source Financial Large Language Models! Revolutionize 🔥 We release the trained model on HuggingFace.
This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
The Unity Machine Learning Agents Toolkit (ML-Agents) is an open-source project that enables games and simulations to serve as environments for training intelligent agents using deep reinforcement lea...
[NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
The AI developer platform. Use Weights & Biases to train and fine-tune models, and manage models from experimentation to production.
DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 Spotlight.
[ICML 2024] Code for the paper "Confronting Reward Overoptimization for Diffusion Models: A Perspective of Inductive and Primacy Biases"
Q学习可视化工具使用强化学习来教智能体学习行动策略。用户可通过自定义地图和不同策略进行实验,并观察未训练和训练后的智能体在地图中的表现。启动模拟命令:python rlearn.py <地图名称>。
Proximal Policy Optimization (PPO) algorithm using PyTorch to train an agent for a rocket landing task in a custom environment
Official Implementation for the paper "R-AIF: Solving Sparse-Reward Robotic Tasks from Pixels with Active Inference and World Models"
Multi-Agent Deep Reinforcement Learning (MA-DRL) Routing Simulator for satellite networks
HFTFramework utilized for research on " A reinforcement learning approach to improve the performance of the Avellaneda-Stoikov market-making algorithm "
IntersectionZoo: Eco-driving for Benchmarking Multi-Agent Contextual Reinforcement Learning
Official code for ICML 2024 paper, "RIME: Robust Preference-based Reinforcement Learning with Noisy Preferences" (ICML 2024 Spotlight)
BricksRL: A Platform for Democratizing Robotics and Reinforcement Learning Research and Education with LEGO
Python library for solving reinforcement learning (RL) problems using generative models (e.g. Diffusion Models).
[IROS 2024] EPH: Ensembling Prioritized Hybrid Policies for Multi-agent Pathfinding
Play, learn, solve, and analyze No-Limit Texas Hold Em. Implementation follows from Monte Carlo counter-factual regret minimization over with hierarchical K-means imperfect recall abstractions.
Using DDPG and ConvLSTM to control a drone to avoid obstacle in AirSim
Doosan robotic arm, simulation, control, visualization in Gazebo and ROS2 for Reinforcement Learning.
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.
DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 Spotlight.
AAAI 2024 Papers: Explore a comprehensive collection of innovative research papers presented at one of the premier artificial intelligence conferences. Seamlessly integrate code implementations for be...
Evaluating and reproducing real-world robot manipulation policies (e.g., RT-1, RT-1-X, Octo) in simulation under common setups (e.g., Google Robot, WidowX+Bridge)
An extensive library of AI resources including books, courses, papers, guides, articles, tutorials, notebooks, AI field advancements and more.
🏛️A research-friendly codebase for fast experimentation of single-agent reinforcement learning in JAX • End-to-End JAX RL
A Fast, Portable Deep Reinforcement Learning Library for Continuous Control
Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScope and StableVideoDiffusion by finetuning them using various rewa...
JAX-accelerated Meta-Reinforcement Learning Environments Inspired by XLand and MiniGrid 🏎️
UMI on Legs: Making Manipulation Policies Mobile with Manipulation-Centric Whole-body Controllers
Awesome LLM Papers and repos on very comprehensive topics.
[CVPR 2024] Code for the paper "Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model"
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga...
《李宏毅深度学习教程》(李宏毅老师推荐👍,苹果书🍎),PDF下载地址:https://github.com/datawhalechina/leedl-tutorial/releases
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.
FinGPT: Open-Source Financial Large Language Models! Revolutionize 🔥 We release the trained model on HuggingFace.
Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.
An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)
A curated list of Artificial Intelligence (AI) courses, books, video lectures and papers.
This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."
📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
强化学习中文教程(蘑菇书🍄),在线阅读地址:https://datawhalechina.github.io/easy-rl/
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention)
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
The AI developer platform. Use Weights & Biases to train and fine-tune models, and manage models from experimentation to production.
The Unity Machine Learning Agents Toolkit (ML-Agents) is an open-source project that enables games and simulations to serve as environments for training intelligent agents using deep reinforcement lea...
DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 Spotlight.
A curated list of reinforcement learning with human feedback resources (continually updated)
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.
Top paper collection for stock price prediction, quantitative trading. Covering top conferences and journals like KDD, WWW, CIKM, AAAI, IJCAI, ACL, EMNLP.
Tactics2D: A Reinforcement Learning Environment Library with Generative Scenarios for Driving Decision-making
[CVPR 2024] Official code for EgoGen: An Egocentric Synthetic Data Generator
Official Implementation for "In-Context Reinforcement Learning for Variable Action Spaces"
Simplifying reinforcement learning for complex game environments
[AAAI 2024] GLOP: Learning Global Partition and Local Construction for Solving Large-scale Routing Problems in Real-time
UMI on Legs: Making Manipulation Policies Mobile with Manipulation-Centric Whole-body Controllers
[NeurIPS 2024] GenRL: Multimodal foundation world models allow grounding language and video prompts into embodied domains, by turning them into sequences of latent world model states. Latent state seq...
[NeurIPS 2024] ReEvo: Large Language Models as Hyper-Heuristics with Reflective Evolution
[RSS 2024]: Expressive Whole-Body Control for Humanoid Robots
A collection of recent papers on building autonomous agent. Two topics included: RL-based / LLM-based agents.
🚗 This repository offers a ready-to-use training and evaluation environment for conducting various experiments using Deep Reinforcement Learning (DRL) in the CARLA simulator with the help of Stable B...
Recall to Imagine, a model-based RL algorithm with superhuman memory. Oral (1.2%) @ ICLR 2024
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention)
Implementation of Soft Actor Critic and some of its improvements in Pytorch