47 results found Sort:

1.9k
9.8k
other
80
强化学习中文教程(蘑菇书🍄),在线阅读地址:https://datawhalechina.github.io/easy-rl/
Created 2020-07-03
1,194 commits to master branch, last one about a month ago
1.1k
8.1k
mit
94
An elegant PyTorch deep reinforcement learning library.
Created 2018-04-16
842 commits to master branch, last one 11 days ago
PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....
Created 2018-06-09
4 commits to master branch, last one 4 years ago
Minimal and Clean Reinforcement Learning Examples
Created 2017-01-13
264 commits to master branch, last one 7 years ago
Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch
Created 2018-09-27
98 commits to master branch, last one about a year ago
266
1.3k
mit
48
Modular Deep Reinforcement Learning framework in PyTorch. Companion library of the book "Foundations of Deep Reinforcement Learning".
Created 2017-10-02
2,569 commits to master branch, last one 2 years ago
189
1.1k
mit
27
PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.
Created 2017-10-17
100 commits to master branch, last one 3 years ago
162
766
mit
28
Deep Reinforcement Learning For Sequence to Sequence Models
Created 2018-05-24
181 commits to master branch, last one 5 years ago
Structural implementation of RL key algorithms
Created 2018-12-10
184 commits to master branch, last one 2 years ago
125
485
other
50
DEEp Reinforcement learning framework
Created 2016-01-21
489 commits to master branch, last one 7 months ago
Master classic RL, deep RL, distributional RL, inverse RL, and more using OpenAI Gym and TensorFlow with extensive Math
Created 2020-10-02
23 commits to master branch, last one 3 years ago
lagom: A PyTorch infrastructure for rapid prototyping of reinforcement learning algorithms.
Created 2017-12-21
703 commits to master branch, last one 5 years ago
Reinforcement learning tutorials
Created 2020-01-13
84 commits to master branch, last one 2 years ago
DeepRL algorithms implementation easy for understanding and reading with Pytorch and Tensorflow 2(DQN, REINFORCE, VPG, A2C, TRPO, PPO, DDPG, TD3, SAC)
Created 2019-07-30
24 commits to master branch, last one 3 years ago
77
302
bsd-3-clause
15
Multi-hop knowledge graph reasoning learned via policy gradient with reward shaping and action dropout
Created 2018-09-26
32 commits to master branch, last one 2 months ago
43
284
mit
14
HandyRL is a handy and simple framework based on Python and PyTorch for distributed reinforcement learning that is applicable to your own environments.
Created 2020-06-03
799 commits to master branch, last one 7 months ago
A resource for learning about deep learning techniques from regression to LSTM and Reinforcement Learning using financial data and the fitness functions of algorithmic trading
Created 2016-07-12
31 commits to master branch, last one 6 months ago
"Attention, Learn to Solve Routing Problems!"[Kool+, 2019], Capacitated Vehicle Routing Problem solver
Created 2020-06-24
89 commits to master branch, last one 3 years ago
Clean baseline implementation of PPO using an episodic TransformerXL memory
Created 2022-05-04
9 commits to main branch, last one 6 months ago
13
154
unknown
2
Code for Paper (ReMax: A Simple, Efficient and Effective Reinforcement Learning Method for Aligning Large Language Models)
Created 2023-10-17
3 commits to master branch, last one about a year ago
A collection of various RL algorithms like policy gradients, DQN and PPO. The goal of this repo will be to make it a go-to resource for learning about RL. How to visualize, debug and solve RL problems...
Created 2021-04-06
53 commits to main branch, last one 3 years ago
PyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.
Created 2020-05-04
89 commits to master branch, last one 3 years ago
Solutions of assignments of Deep Reinforcement Learning course presented by the University of California, Berkeley (CS285) in Pytorch framework
Created 2020-06-13
38 commits to master branch, last one 4 years ago
Baseline implementation of recurrent PPO using truncated BPTT
Created 2021-06-07
13 commits to main branch, last one about a year ago
Tutorial4RL: Tutorial for Reinforcement Learning. 强化学习入门教程.
Created 2023-01-07
16 commits to main branch, last one 8 months ago