2 results found Sort:
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) ...
Created
2017-08-22
274 commits to master branch, last one 3 years ago
Reinforcement Learning Coach by Intel AI Lab enables easy experimentation with state of the art Reinforcement Learning algorithms
This repository has been archived
(exclude archived)
Created
2017-10-01
524 commits to master branch, last one about a year ago