2 results found Sort:

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) ...
Created 2017-08-22
274 commits to master branch, last one 3 years ago
462
2.3k
apache-2.0
126
Reinforcement Learning Coach by Intel AI Lab enables easy experimentation with state of the art Reinforcement Learning algorithms
This repository has been archived (exclude archived)
Created 2017-10-01
524 commits to master branch, last one about a year ago