49 results found Sort:

2.0k
10.9k
other
78
强化学习中文教程(蘑菇书🍄),在线阅读地址:https://datawhalechina.github.io/easy-rl/
Created 2020-07-03
1,204 commits to master branch, last one 17 days ago
1.1k
8.4k
mit
91
An elegant PyTorch deep reinforcement learning library.
Created 2018-04-16
847 commits to master branch, last one about a month ago
PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....
Created 2018-06-09
4 commits to master branch, last one 5 years ago
Minimal and Clean Reinforcement Learning Examples
Created 2017-01-13
264 commits to master branch, last one 7 years ago
Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch
Created 2018-09-27
98 commits to master branch, last one about a year ago
274
1.3k
mit
46
Modular Deep Reinforcement Learning framework in PyTorch. Companion library of the book "Foundations of Deep Reinforcement Learning".
Created 2017-10-02
2,573 commits to master branch, last one about a month ago
190
1.2k
mit
26
PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.
Created 2017-10-17
100 commits to master branch, last one 4 years ago
163
767
mit
27
Deep Reinforcement Learning For Sequence to Sequence Models
Created 2018-05-24
181 commits to master branch, last one 5 years ago
Structural implementation of RL key algorithms
Created 2018-12-10
184 commits to master branch, last one 2 years ago
123
484
other
50
DEEp Reinforcement learning framework
Created 2016-01-21
489 commits to master branch, last one 11 months ago
Master classic RL, deep RL, distributional RL, inverse RL, and more using OpenAI Gym and TensorFlow with extensive Math
Created 2020-10-02
23 commits to master branch, last one 4 years ago
lagom: A PyTorch infrastructure for rapid prototyping of reinforcement learning algorithms.
Created 2017-12-21
703 commits to master branch, last one 5 years ago
Reinforcement learning tutorials
Created 2020-01-13
84 commits to master branch, last one 2 years ago
DeepRL algorithms implementation easy for understanding and reading with Pytorch and Tensorflow 2(DQN, REINFORCE, VPG, A2C, TRPO, PPO, DDPG, TD3, SAC)
Created 2019-07-30
24 commits to master branch, last one 3 years ago
80
309
bsd-3-clause
14
Multi-hop knowledge graph reasoning learned via policy gradient with reward shaping and action dropout
Created 2018-09-26
33 commits to master branch, last one about a month ago
43
288
other
12
HandyRL is a handy and simple framework based on Python and PyTorch for distributed reinforcement learning that is applicable to your own environments.
Created 2020-06-03
813 commits to master branch, last one about a month ago
A resource for learning about deep learning techniques from regression to LSTM and Reinforcement Learning using financial data and the fitness functions of algorithmic trading
Created 2016-07-12
31 commits to master branch, last one 10 months ago
"Attention, Learn to Solve Routing Problems!"[Kool+, 2019], Capacitated Vehicle Routing Problem solver
Created 2020-06-24
89 commits to master branch, last one 4 years ago
13
181
unknown
2
Code for Paper (ReMax: A Simple, Efficient and Effective Reinforcement Learning Method for Aligning Large Language Models)
Created 2023-10-17
3 commits to master branch, last one about a year ago
Clean baseline implementation of PPO using an episodic TransformerXL memory
Created 2022-05-04
9 commits to main branch, last one 10 months ago
A collection of various RL algorithms like policy gradients, DQN and PPO. The goal of this repo will be to make it a go-to resource for learning about RL. How to visualize, debug and solve RL problems...
Created 2021-04-06
53 commits to main branch, last one 3 years ago
PyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.
Created 2020-05-04
89 commits to master branch, last one 3 years ago
Tutorial4RL: Tutorial for Reinforcement Learning. 强化学习入门教程.
Created 2023-01-07
16 commits to main branch, last one about a year ago
Baseline implementation of recurrent PPO using truncated BPTT
Created 2021-06-07
13 commits to main branch, last one about a year ago
Solutions of assignments of Deep Reinforcement Learning course presented by the University of California, Berkeley (CS285) in Pytorch framework
Created 2020-06-13
38 commits to master branch, last one 4 years ago