4 results found Sort:

Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch
Created 2018-09-27
98 commits to master branch, last one 6 months ago
Concise pytorch implements of DRL algorithms, including REINFORCE, A2C, DQN, PPO(discrete and continuous), DDPG, TD3, SAC.
Created 2022-03-23
163 commits to main branch, last one about a year ago
PyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.
Created 2020-05-04
89 commits to master branch, last one 2 years ago