47 results found Sort:
- Filter by Primary Language:
- Python (31)
- Jupyter Notebook (13)
- +
强化学习中文教程(蘑菇书🍄),在线阅读地址:https://datawhalechina.github.io/easy-rl/
Created
2020-07-03
1,194 commits to master branch, last one about a month ago
Simple Reinforcement learning tutorials, 莫烦Python 中文AI教学
a3c
dqn
ppo
ddpg
sarsa
tutorial
double-dqn
q-learning
dueling-dqn
actor-critic
sarsa-lambda
deep-q-network
policy-gradient
machine-learning
prioritized-replay
tensorflow-tutorials
reinforcement-learning
proximal-policy-optimization
deep-deterministic-policy-gradient
asynchronous-advantage-actor-critic
Created
2017-05-06
114 commits to master branch, last one 2 years ago
An elegant PyTorch deep reinforcement learning library.
Created
2018-04-16
842 commits to master branch, last one 11 days ago
PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....
Created
2018-06-09
4 commits to master branch, last one 4 years ago
Minimal and Clean Reinforcement Learning Examples
Created
2017-01-13
264 commits to master branch, last one 7 years ago
Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch
Created
2018-09-27
98 commits to master branch, last one about a year ago
Modular Deep Reinforcement Learning framework in PyTorch. Companion library of the book "Foundations of Deep Reinforcement Learning".
Created
2017-10-02
2,569 commits to master branch, last one 2 years ago
PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.
Created
2017-10-17
100 commits to master branch, last one 3 years ago
Scalable, event-driven, deep-learning-friendly backtesting library
a3c
unreal
backtrader
openai-gym
tensorflow
time-series
hacktoberfest
gym-environment
policy-gradient
algoritmic-trading
quantitive-finance
policy-optimisation
statistical-arbitrage
advantage-actor-critic
reinforcement-learning
algorithmic-trading-library
deep-reinforcement-learning
backtesting-trading-strategies
Created
2017-05-24
635 commits to master branch, last one 3 years ago
Master Reinforcement and Deep Reinforcement Learning using OpenAI Gym and TensorFlow
ppo
drqn
trpo
sarsa
double-dqn
openai-gym
q-learning
dueling-dqn
monte-carlo
deep-q-network
policy-gradient
policy-gradients
reinforcement-learning
deep-learning-algorithms
deep-recurrent-q-network
markov-decision-processes
deep-reinforcement-learning
hindsight-experience-replay
deep-deterministic-policy-gradient
asynchronous-advantage-actor-critic
Created
2018-06-11
44 commits to master branch, last one 4 years ago
Deep Reinforcement Learning For Sequence to Sequence Models
Created
2018-05-24
181 commits to master branch, last one 5 years ago
A curated list of Monte Carlo tree search papers with implementations.
Created
2019-11-22
112 commits to master branch, last one 9 months ago
Structural implementation of RL key algorithms
Created
2018-12-10
184 commits to master branch, last one 2 years ago
DEEp Reinforcement learning framework
Created
2016-01-21
489 commits to master branch, last one 7 months ago
Master classic RL, deep RL, distributional RL, inverse RL, and more using OpenAI Gym and TensorFlow with extensive Math
Created
2020-10-02
23 commits to master branch, last one 3 years ago
lagom: A PyTorch infrastructure for rapid prototyping of reinforcement learning algorithms.
Created
2017-12-21
703 commits to master branch, last one 5 years ago
Reinforcement learning tutorials
Created
2020-01-13
84 commits to master branch, last one 2 years ago
DeepRL algorithms implementation easy for understanding and reading with Pytorch and Tensorflow 2(DQN, REINFORCE, VPG, A2C, TRPO, PPO, DDPG, TD3, SAC)
Created
2019-07-30
24 commits to master branch, last one 3 years ago
📚 List of Top-tier Conference Papers on Reinforcement Learning (RL),including: NeurIPS, ICML, AAAI, IJCAI, AAMAS, ICLR, ICRA, etc.
dqn
aaai
icml
ijcai
neurips
rl-papers
q-learning
policy-gradient
imitation-learning
reinforcement-learning
artificial-intelligence
deep-reinforcement-learning
meta-reinforcement-learning
reinforcement-learning-paper
reinforcement-learning-papers
offline-reinforcement-learning
multi-agent-reinforcement-learning
reinforcement-learning-conferences
hierarchical-reinforcement-learning
reinforcement-learning-conferences-papers
Created
2023-01-05
117 commits to main branch, last one 6 months ago
Multi-hop knowledge graph reasoning learned via policy gradient with reward shaping and action dropout
Created
2018-09-26
32 commits to master branch, last one 2 months ago
HandyRL is a handy and simple framework based on Python and PyTorch for distributed reinforcement learning that is applicable to your own environments.
Created
2020-06-03
799 commits to master branch, last one 7 months ago
A resource for learning about deep learning techniques from regression to LSTM and Reinforcement Learning using financial data and the fitness functions of algorithmic trading
Created
2016-07-12
31 commits to master branch, last one 6 months ago
"Attention, Learn to Solve Routing Problems!"[Kool+, 2019], Capacitated Vehicle Routing Problem solver
Created
2020-06-24
89 commits to master branch, last one 3 years ago
Clean baseline implementation of PPO using an episodic TransformerXL memory
Created
2022-05-04
9 commits to main branch, last one 6 months ago
Code for Paper (ReMax: A Simple, Efficient and Effective Reinforcement Learning Method for Aligning Large Language Models)
Created
2023-10-17
3 commits to master branch, last one about a year ago
A collection of various RL algorithms like policy gradients, DQN and PPO. The goal of this repo will be to make it a go-to resource for learning about RL. How to visualize, debug and solve RL problems...
Created
2021-04-06
53 commits to main branch, last one 3 years ago
PyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.
Created
2020-05-04
89 commits to master branch, last one 3 years ago
Solutions of assignments of Deep Reinforcement Learning course presented by the University of California, Berkeley (CS285) in Pytorch framework
Created
2020-06-13
38 commits to master branch, last one 4 years ago
Baseline implementation of recurrent PPO using truncated BPTT
Created
2021-06-07
13 commits to main branch, last one about a year ago
Tutorial4RL: Tutorial for Reinforcement Learning. 强化学习入门教程.
Created
2023-01-07
16 commits to main branch, last one 8 months ago