4 results found Sort:

Proximal Policy Optimization (PPO) algorithm for Super Mario Bros
Created 2019-10-02
9 commits to master branch, last one 3 years ago
Proximal Policy Optimization (PPO) algorithm for Contra
Created 2019-09-06
3 commits to master branch, last one 3 years ago
This is a reinforcement learning algorithm library. The code takes into account both performance and simplicity, with little dependence.
Created 2021-03-31
23 commits to master branch, last one 2 years ago
PPO, DDPG, SAC implementation on mujoco environment
Created 2021-03-01
55 commits to main branch, last one 2 years ago