44 results found Sort:

672
5.9k
other
38
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
Created 2019-06-07
832 commits to master branch, last one about a month ago
PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....
Created 2018-06-09
4 commits to master branch, last one 4 years ago
Implementations from the free course Deep Reinforcement Learning with Tensorflow and PyTorch
Created 2018-03-25
166 commits to master branch, last one about a year ago
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) ...
Created 2017-08-22
274 commits to master branch, last one 3 years ago
Minimal and Clean Reinforcement Learning Examples
Created 2017-01-13
264 commits to master branch, last one 7 years ago
PyTorch implementation of Asynchronous Advantage Actor Critic (A3C) from "Asynchronous Methods for Deep Reinforcement Learning".
Created 2017-02-13
46 commits to master branch, last one 5 years ago
225
1.2k
mit
70
ChainerRL is a deep reinforcement learning library built on top of Chainer.
Created 2017-01-30
3,471 commits to master branch, last one 3 years ago
Deep Reinforcement Learning with pytorch & visdom
Created 2017-04-10
85 commits to master branch, last one 6 years ago
162
766
mit
28
Deep Reinforcement Learning For Sequence to Sequence Models
Created 2018-05-24
181 commits to master branch, last one 5 years ago
This repository contains most of pytorch implementation based classic deep reinforcement learning algorithms, including - DQN, DDQN, Dueling Network, DDPG, SAC, A2C, PPO, TRPO. (More algorithms are st...
Created 2018-01-13
25 commits to master branch, last one 3 years ago
Simple A3C implementation with pytorch + multiprocessing
Created 2018-01-18
17 commits to master branch, last one 4 years ago
Python code, PDFs and resources for the series of posts on Reinforcement Learning which I published on my personal blog
Created 2016-12-09
89 commits to master branch, last one about a year ago
119
562
apache-2.0
20
A3C LSTM Atari with Pytorch plus A3G design
Created 2017-05-25
172 commits to master branch, last one about a year ago
PyTorch implementation of Soft Actor-Critic (SAC)
Created 2020-01-22
27 commits to master branch, last one 3 years ago
🔥🌟《Machine Learning 格物志》: ML + DL + RL basic codes and notes by sklearn, PyTorch, TensorFlow, Keras & the most important, from scratch!💪 This repository is ALL You Need!
Created 2020-04-02
130 commits to master branch, last one about a year ago
Master classic RL, deep RL, distributional RL, inverse RL, and more using OpenAI Gym and TensorFlow with extensive Math
Created 2020-10-02
23 commits to master branch, last one 3 years ago
Code for Hands On Intelligent Agents with OpenAI Gym book to get started and learn to build deep reinforcement learning agents using PyTorch
Created 2018-05-09
244 commits to master branch, last one about a year ago
PyTorch implementation of Hierarchical Actor Critic (HAC) for OpenAI gym environments
Created 2019-06-03
38 commits to master branch, last one 3 years ago
29
204
gpl-3.0
21
Reinforcement learning framework to accelerate research
Created 2018-08-23
275 commits to master branch, last one 3 years ago
Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO
Created 2019-04-07
25 commits to master branch, last one 2 years ago
Pytorch LSTM RNN for reinforcement learning to play Atari games from OpenAI Universe. We also use Google Deep Mind's Asynchronous Advantage Actor-Critic (A3C) Algorithm. This is much superior and effi...
Created 2017-09-27
14 commits to master branch, last one 3 months ago
43
171
apache-2.0
5
A high-performance Atari A3C agent in 180 lines of PyTorch
Created 2017-10-03
42 commits to master branch, last one 3 years ago
"Neural Combinatorial Optimization with Reinforcement Learning"[Bello+, 2016], Traveling Salesman Problem solver
Created 2020-05-11
80 commits to master branch, last one 3 years ago
Clean baseline implementation of PPO using an episodic TransformerXL memory
Created 2022-05-04
9 commits to main branch, last one 6 months ago
Implementation of the paper "Overcoming Exploration in Reinforcement Learning with Demonstrations" Nair et al. over the HER baselines from OpenAI
Created 2018-06-11
54 commits to master branch, last one 3 years ago
Baseline implementation of recurrent PPO using truncated BPTT
Created 2021-06-07
13 commits to main branch, last one about a year ago
Code for our paper "Visualizing and Understanding Atari Agents" (https://goo.gl/AMAoSc)
Created 2017-10-31
14 commits to master branch, last one 3 years ago