8 results found Sort:
- Filter by Primary Language:
- Python (5)
- Jupyter Notebook (2)
- C++ (1)
- +
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) ...
Created
2017-08-22
274 commits to master branch, last one 3 years ago
[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)
Created
2022-10-08
190 commits to main branch, last one 11 days ago
The Fastest Deep Reinforcement Learning Library
Created
2023-11-11
2,553 commits to master branch, last one 23 days ago
JAX (Flax) implementation of algorithms for Deep Reinforcement Learning with continuous action spaces.
Created
2021-01-16
133 commits to main branch, last one 2 years ago
PyTorch implementation of Soft Actor-Critic (SAC)
Created
2020-01-22
27 commits to master branch, last one 3 years ago
End to end motion planner using Deep Deterministic Policy Gradient (DDPG) in gazebo
Created
2019-07-20
17 commits to master branch, last one 2 years ago
Official PyTorch code for "Recurrent Off-policy Baselines for Memory-based Continuous Control" (DeepRL Workshop, NeurIPS 21)
Created
2021-05-01
1,019 commits to pub branch, last one about a year ago
PyTorch Implementation of the RDPG (Recurrent Deterministic Policy Gradient)
Created
2018-02-10
17 commits to master branch, last one 5 years ago