49 results found Sort:
- Filter by Primary Language:
- Python (33)
- Jupyter Notebook (13)
- +
强化学习中文教程(蘑菇书🍄),在线阅读地址:https://datawhalechina.github.io/easy-rl/
Created
2020-07-03
1,204 commits to master branch, last one 17 days ago
Simple Reinforcement learning tutorials, 莫烦Python 中文AI教学
a3c
dqn
ppo
ddpg
sarsa
tutorial
double-dqn
q-learning
dueling-dqn
actor-critic
sarsa-lambda
deep-q-network
policy-gradient
machine-learning
prioritized-replay
tensorflow-tutorials
reinforcement-learning
proximal-policy-optimization
deep-deterministic-policy-gradient
asynchronous-advantage-actor-critic
Created
2017-05-06
114 commits to master branch, last one 2 years ago
An elegant PyTorch deep reinforcement learning library.
Created
2018-04-16
847 commits to master branch, last one about a month ago
PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....
Created
2018-06-09
4 commits to master branch, last one 5 years ago
Minimal and Clean Reinforcement Learning Examples
Created
2017-01-13
264 commits to master branch, last one 7 years ago
Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch
Created
2018-09-27
98 commits to master branch, last one about a year ago
Modular Deep Reinforcement Learning framework in PyTorch. Companion library of the book "Foundations of Deep Reinforcement Learning".
Created
2017-10-02
2,573 commits to master branch, last one about a month ago
PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.
Created
2017-10-17
100 commits to master branch, last one 4 years ago
Scalable, event-driven, deep-learning-friendly backtesting library
a3c
unreal
backtrader
openai-gym
tensorflow
time-series
hacktoberfest
gym-environment
policy-gradient
algoritmic-trading
quantitive-finance
policy-optimisation
statistical-arbitrage
advantage-actor-critic
reinforcement-learning
algorithmic-trading-library
deep-reinforcement-learning
backtesting-trading-strategies
Created
2017-05-24
635 commits to master branch, last one 3 years ago
Master Reinforcement and Deep Reinforcement Learning using OpenAI Gym and TensorFlow
ppo
drqn
trpo
sarsa
double-dqn
openai-gym
q-learning
dueling-dqn
monte-carlo
deep-q-network
policy-gradient
policy-gradients
reinforcement-learning
deep-learning-algorithms
deep-recurrent-q-network
markov-decision-processes
deep-reinforcement-learning
hindsight-experience-replay
deep-deterministic-policy-gradient
asynchronous-advantage-actor-critic
Created
2018-06-11
44 commits to master branch, last one 4 years ago
Deep Reinforcement Learning For Sequence to Sequence Models
Created
2018-05-24
181 commits to master branch, last one 5 years ago
A curated list of Monte Carlo tree search papers with implementations.
Created
2019-11-22
112 commits to master branch, last one about a year ago
Structural implementation of RL key algorithms
Created
2018-12-10
184 commits to master branch, last one 2 years ago
DEEp Reinforcement learning framework
Created
2016-01-21
489 commits to master branch, last one 11 months ago
Master classic RL, deep RL, distributional RL, inverse RL, and more using OpenAI Gym and TensorFlow with extensive Math
Created
2020-10-02
23 commits to master branch, last one 4 years ago
lagom: A PyTorch infrastructure for rapid prototyping of reinforcement learning algorithms.
Created
2017-12-21
703 commits to master branch, last one 5 years ago
Reinforcement learning tutorials
Created
2020-01-13
84 commits to master branch, last one 2 years ago
DeepRL algorithms implementation easy for understanding and reading with Pytorch and Tensorflow 2(DQN, REINFORCE, VPG, A2C, TRPO, PPO, DDPG, TD3, SAC)
Created
2019-07-30
24 commits to master branch, last one 3 years ago
📚 List of Top-tier Conference Papers on Reinforcement Learning (RL),including: NeurIPS, ICML, AAAI, IJCAI, AAMAS, ICLR, ICRA, etc.
dqn
aaai
icml
ijcai
neurips
rl-papers
q-learning
policy-gradient
imitation-learning
reinforcement-learning
artificial-intelligence
deep-reinforcement-learning
meta-reinforcement-learning
reinforcement-learning-paper
reinforcement-learning-papers
offline-reinforcement-learning
multi-agent-reinforcement-learning
reinforcement-learning-conferences
hierarchical-reinforcement-learning
reinforcement-learning-conferences-papers
Created
2023-01-05
117 commits to main branch, last one 10 months ago
Multi-hop knowledge graph reasoning learned via policy gradient with reward shaping and action dropout
Created
2018-09-26
33 commits to master branch, last one about a month ago
HandyRL is a handy and simple framework based on Python and PyTorch for distributed reinforcement learning that is applicable to your own environments.
Created
2020-06-03
813 commits to master branch, last one about a month ago
A resource for learning about deep learning techniques from regression to LSTM and Reinforcement Learning using financial data and the fitness functions of algorithmic trading
Created
2016-07-12
31 commits to master branch, last one 10 months ago
"Attention, Learn to Solve Routing Problems!"[Kool+, 2019], Capacitated Vehicle Routing Problem solver
Created
2020-06-24
89 commits to master branch, last one 4 years ago
Code for Paper (ReMax: A Simple, Efficient and Effective Reinforcement Learning Method for Aligning Large Language Models)
Created
2023-10-17
3 commits to master branch, last one about a year ago
Clean baseline implementation of PPO using an episodic TransformerXL memory
Created
2022-05-04
9 commits to main branch, last one 10 months ago
A collection of various RL algorithms like policy gradients, DQN and PPO. The goal of this repo will be to make it a go-to resource for learning about RL. How to visualize, debug and solve RL problems...
Created
2021-04-06
53 commits to main branch, last one 3 years ago
PyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.
Created
2020-05-04
89 commits to master branch, last one 3 years ago
Tutorial4RL: Tutorial for Reinforcement Learning. 强化学习入门教程.
Created
2023-01-07
16 commits to main branch, last one about a year ago
Baseline implementation of recurrent PPO using truncated BPTT
Created
2021-06-07
13 commits to main branch, last one about a year ago
Solutions of assignments of Deep Reinforcement Learning course presented by the University of California, Berkeley (CS285) in Pytorch framework
Created
2020-06-13
38 commits to master branch, last one 4 years ago