6 results found Sort:

Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch
Created 2018-09-27
98 commits to master branch, last one 11 months ago
Concise pytorch implements of DRL algorithms, including REINFORCE, A2C, DQN, PPO(discrete and continuous), DDPG, TD3, SAC.
Created 2022-03-23
163 commits to main branch, last one about a year ago
PyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.
Created 2020-05-04
89 commits to master branch, last one 2 years ago
Proximal Policy Optimization (PPO) algorithm using PyTorch to train an agent for a rocket landing task in a custom environment
Created 2024-11-01
8 commits to master branch, last one 5 days ago
Implementation of PPO Lagrangian in PyTorch
Created 2021-08-06
17 commits to main branch, last one 2 years ago