8 results found Sort:

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) ...
Created 2017-08-22
274 commits to master branch, last one 3 years ago
125
1.2k
apache-2.0
12
[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)
Created 2022-10-08
190 commits to main branch, last one 11 days ago
The Fastest Deep Reinforcement Learning Library
Created 2023-11-11
2,553 commits to master branch, last one 23 days ago
70
640
mit
13
JAX (Flax) implementation of algorithms for Deep Reinforcement Learning with continuous action spaces.
Created 2021-01-16
133 commits to main branch, last one 2 years ago
PyTorch implementation of Soft Actor-Critic (SAC)
Created 2020-01-22
27 commits to master branch, last one 3 years ago
End to end motion planner using Deep Deterministic Policy Gradient (DDPG) in gazebo
Created 2019-07-20
17 commits to master branch, last one 2 years ago
Official PyTorch code for "Recurrent Off-policy Baselines for Memory-based Continuous Control" (DeepRL Workshop, NeurIPS 21)
Created 2021-05-01
1,019 commits to pub branch, last one about a year ago
PyTorch Implementation of the RDPG (Recurrent Deterministic Policy Gradient)
Created 2018-02-10
17 commits to master branch, last one 5 years ago