28 results found Sort:
- Filter by Primary Language:
- Python (22)
- Jupyter Notebook (4)
- C# (1)
- JavaScript (1)
- +
Simple Reinforcement learning tutorials, 莫烦Python 中文AI教学
a3c
dqn
ppo
ddpg
sarsa
tutorial
double-dqn
q-learning
dueling-dqn
actor-critic
sarsa-lambda
deep-q-network
policy-gradient
machine-learning
prioritized-replay
tensorflow-tutorials
reinforcement-learning
proximal-policy-optimization
deep-deterministic-policy-gradient
asynchronous-advantage-actor-critic
Created
2017-05-06
114 commits to master branch, last one 2 years ago
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
Created
2019-06-07
832 commits to master branch, last one about a month ago
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) ...
Created
2017-08-22
274 commits to master branch, last one 3 years ago
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)
Created
2023-07-30
1,010 commits to main branch, last one 21 hours ago
Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch
Created
2018-09-27
98 commits to master branch, last one about a year ago
PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.
Created
2017-10-17
100 commits to master branch, last one 3 years ago
Proximal Policy Optimization (PPO) algorithm for Super Mario Bros
Created
2019-10-02
9 commits to master branch, last one 3 years ago
This repository contains most of pytorch implementation based classic deep reinforcement learning algorithms, including - DQN, DDQN, Dueling Network, DDPG, SAC, A2C, PPO, TRPO. (More algorithms are st...
Created
2018-01-13
25 commits to master branch, last one 3 years ago
A PyTorch library for building deep reinforcement learning agents.
Created
2018-11-07
300 commits to develop branch, last one 9 months ago
lagom: A PyTorch infrastructure for rapid prototyping of reinforcement learning algorithms.
Created
2017-12-21
703 commits to master branch, last one 5 years ago
Deep Reinforcement Learning (PPO) in Autonomous Driving (Carla) [from scratch]
Created
2022-05-18
46 commits to main branch, last one 10 months ago
Trading Environment(OpenAI Gym) + PPO(TensorForce)
Created
2018-08-25
8 commits to master branch, last one 6 years ago
Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO
Created
2019-04-07
25 commits to master branch, last one 2 years ago
Deep Reinforcement Learning in C#
dqn
ppo
sac
gail
dotnet
cartpole
gail-ppo
multi-agent
deep-learning
machine-learning
multi-environment
soft-actor-critic
reinforcement-learning
deep-reinforcement-learning
proximal-policy-optimization
reinforcement-learning-agent
reinforcement-learning-algorithms
multi-agent-reinforcement-learning
reinforcement-learning-environments
Created
2023-08-15
77 commits to master branch, last one 2 months ago
Clean baseline implementation of PPO using an episodic TransformerXL memory
Created
2022-05-04
9 commits to main branch, last one 6 months ago
PyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.
Created
2020-05-04
89 commits to master branch, last one 3 years ago
Proximal Policy Optimization (PPO) algorithm for Contra
Created
2019-09-06
3 commits to master branch, last one 3 years ago
Curiosity-driven Exploration by Self-supervised Prediction
Created
2018-11-23
12 commits to master branch, last one about a year ago
Baseline implementation of recurrent PPO using truncated BPTT
Created
2021-06-07
13 commits to main branch, last one about a year ago
Code for the paper "Reinforced Curriculum Learning for Autonomous Driving in CARLA" (ICIP 2021)
Created
2020-03-12
45 commits to master branch, last one 2 years ago
A Torch Based RL Framework for Rapid Prototyping of Research Papers
Created
2023-04-10
85 commits to master branch, last one 17 days ago
强化学习算法库,包含了目前主流的强化学习算法(Value based and Policy based)的代码,代码都经过调试并可以运行
Created
2021-12-01
193 commits to main branch, last one about a year ago
Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights on all environments.
Created
2021-09-03
12 commits to main branch, last one 2 years ago
This repository contains an application using ROS2 Humble, Gazebo, OpenAI Gym and Stable Baselines3 to train reinforcement learning agents for a path planning problem.
Created
2023-02-07
64 commits to humble branch, last one about a year ago
An implementation of Phasic Policy Gradient, a proposed improvement of Proximal Policy Gradients, in Pytorch
Created
2020-09-27
56 commits to master branch, last one 18 days ago
TraderNet-CRv2 - Combining Deep Reinforcement Learning with Technical Analysis and Trend Monitoring on Cryptocurrency Markets
ddqn
python
tf-agents
tensorflow
trading-bot
google-trends
ppo-algorithm
explainability
reward-shaping
machine-learning
google-trends-api
technical-analysis
trading-algorithms
deep-neural-networks
double-dqn-algorithm
risk-adjusted-return
technical-indicators
cryptocurrency-trading
deep-reinforcement-learning
proximal-policy-optimization
Created
2022-12-29
5 commits to main branch, last one about a year ago
🚗 3D web app that combines Proximal Policy Optimization with Three.js, enabling users to directly interact with or train AI models on a virtual racetrack.
Created
2023-09-21
101 commits to main branch, last one 9 months ago
Quantum error correction code AI-discovery with Jax
Created
2024-02-23
18 commits to main branch, last one 12 days ago