42 results found Sort:

557
4.8k
other
35
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
Created 2019-06-07
828 commits to master branch, last one 16 days ago
Implementations from the free course Deep Reinforcement Learning with Tensorflow and PyTorch
Created 2018-03-25
166 commits to master branch, last one about a year ago
PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....
Created 2018-06-09
4 commits to master branch, last one 4 years ago
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) ...
Created 2017-08-22
274 commits to master branch, last one 2 years ago
Minimal and Clean Reinforcement Learning Examples
Created 2017-01-13
264 commits to master branch, last one 6 years ago
PyTorch implementation of Asynchronous Advantage Actor Critic (A3C) from "Asynchronous Methods for Deep Reinforcement Learning".
Created 2017-02-13
46 commits to master branch, last one 5 years ago
226
1.2k
mit
71
ChainerRL is a deep reinforcement learning library built on top of Chainer.
Created 2017-01-30
3,471 commits to master branch, last one 3 years ago
Deep Reinforcement Learning with pytorch & visdom
Created 2017-04-10
85 commits to master branch, last one 6 years ago
160
762
mit
28
Deep Reinforcement Learning For Sequence to Sequence Models
Created 2018-05-24
181 commits to master branch, last one 5 years ago
This repository contains most of pytorch implementation based classic deep reinforcement learning algorithms, including - DQN, DDQN, Dueling Network, DDPG, SAC, A2C, PPO, TRPO. (More algorithms are st...
Created 2018-01-13
25 commits to master branch, last one 3 years ago
Python code, PDFs and resources for the series of posts on Reinforcement Learning which I published on my personal blog
Created 2016-12-09
89 commits to master branch, last one about a year ago
Simple A3C implementation with pytorch + multiprocessing
Created 2018-01-18
17 commits to master branch, last one 4 years ago
118
558
apache-2.0
20
A3C LSTM Atari with Pytorch plus A3G design
Created 2017-05-25
172 commits to master branch, last one about a year ago
PyTorch implementation of Soft Actor-Critic (SAC)
Created 2020-01-22
27 commits to master branch, last one 3 years ago
🔥🌟《Machine Learning 格物志》: ML + DL + RL basic codes and notes by sklearn, PyTorch, TensorFlow, Keras & the most important, from scratch!💪 This repository is ALL You Need!
Created 2020-04-02
130 commits to master branch, last one 10 months ago
Code for Hands On Intelligent Agents with OpenAI Gym book to get started and learn to build deep reinforcement learning agents using PyTorch
Created 2018-05-09
244 commits to master branch, last one about a year ago
Master classic RL, deep RL, distributional RL, inverse RL, and more using OpenAI Gym and TensorFlow with extensive Math
Created 2020-10-02
23 commits to master branch, last one 3 years ago
PyTorch implementation of Hierarchical Actor Critic (HAC) for OpenAI gym environments
Created 2019-06-03
38 commits to master branch, last one 2 years ago
30
202
gpl-3.0
21
Reinforcement learning framework to accelerate research
Created 2018-08-23
275 commits to master branch, last one 3 years ago
Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO
Created 2019-04-07
25 commits to master branch, last one about a year ago
43
169
apache-2.0
5
A high-performance Atari A3C agent in 180 lines of PyTorch
Created 2017-10-03
42 commits to master branch, last one 2 years ago
"Neural Combinatorial Optimization with Reinforcement Learning"[Bello+, 2016], Traveling Salesman Problem solver
Created 2020-05-11
80 commits to master branch, last one 3 years ago
Implementation of the paper "Overcoming Exploration in Reinforcement Learning with Demonstrations" Nair et al. over the HER baselines from OpenAI
Created 2018-06-11
54 commits to master branch, last one 2 years ago
Clean baseline implementation of PPO using an episodic TransformerXL memory
Created 2022-05-04
9 commits to main branch, last one 8 days ago
Code for our paper "Visualizing and Understanding Atari Agents" (https://goo.gl/AMAoSc)
Created 2017-10-31
14 commits to master branch, last one 2 years ago
13
116
apache-2.0
4
Paddle-RLBooks is a reinforcement learning code study guide based on pure PaddlePaddle.
Created 2021-03-21
24 commits to main branch, last one 2 years ago
Baseline implementation of recurrent PPO using truncated BPTT
Created 2021-06-07
13 commits to main branch, last one 9 months ago