98 results found Sort:

1.7k
8.4k
other
78
强化学习中文教程(蘑菇书🍄),在线阅读地址:https://datawhalechina.github.io/easy-rl/
Created 2020-07-03
1,164 commits to master branch, last one 2 months ago
1.1k
7.5k
mit
91
An elegant PyTorch deep reinforcement learning library.
Created 2018-04-16
729 commits to master branch, last one 8 days ago
543
4.6k
other
34
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
Created 2019-06-07
827 commits to master branch, last one 2 months ago
Learn Deep Reinforcement Learning in 60 days! Lectures & Code in Python. Reinforcement Learning + Deep Learning
Created 2018-08-28
146 commits to master branch, last one 4 years ago
Implementations from the free course Deep Reinforcement Learning with Tensorflow and PyTorch
Created 2018-03-25
166 commits to master branch, last one about a year ago
PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....
Created 2018-06-09
4 commits to master branch, last one 4 years ago
Massively Parallel Deep Reinforcement Learning. 🔥
Created 2019-07-12
2,341 commits to master branch, last one 26 days ago
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) ...
Created 2017-08-22
274 commits to master branch, last one 2 years ago
Modularized Implementation of Deep RL Algorithms in PyTorch
Created 2017-04-20
480 commits to master branch, last one about a month ago
Implementations of basic RL algorithms with minimal lines of codes! (pytorch based)
Created 2019-04-23
91 commits to master branch, last one about a year ago
Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch
Created 2018-09-27
98 commits to master branch, last one 5 months ago
262
1.2k
mit
49
Modular Deep Reinforcement Learning framework in PyTorch. Companion library of the book "Foundations of Deep Reinforcement Learning".
Created 2017-10-02
2,569 commits to master branch, last one 2 years ago
This is the official implementation of Multi-Agent PPO (MAPPO).
Created 2021-02-23
126 commits to main branch, last one 3 months ago
184
1.1k
mit
27
PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.
Created 2017-10-17
100 commits to master branch, last one 3 years ago
Proximal Policy Optimization (PPO) algorithm for Super Mario Bros
Created 2019-10-02
9 commits to master branch, last one 2 years ago
Clean, Robust, and Unified PyTorch implementation of popular DRL Algorithms (Q-learning, Duel DDQN, PER, C51, Noisy DQN, PPO, DDPG, TD3, SAC, ASL)
Created 2021-11-14
86 commits to main branch, last one 3 months ago
A simple and well styled PPO implementation. Based on my Medium series: https://medium.com/@eyyu/coding-ppo-from-scratch-with-pytorch-part-1-4-613dfc1b14c8.
Created 2020-09-04
48 commits to master branch, last one 9 months ago
This repository contains most of pytorch implementation based classic deep reinforcement learning algorithms, including - DQN, DDQN, Dueling Network, DDPG, SAC, A2C, PPO, TRPO. (More algorithms are st...
Created 2018-01-13
25 commits to master branch, last one 3 years ago
RL starter files in order to immediately train, visualize and evaluate an agent without writing any line of code
Created 2018-04-11
497 commits to master branch, last one about a year ago
32 projects in the framework of Deep Reinforcement Learning algorithms: Q-learning, DQN, PPO, DDPG, TD3, SAC, A2C and others. Each project is provided with a detailed training log.
Created 2019-04-07
995 commits to master branch, last one 2 years ago
30
595
apache-2.0
6
A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).
Created 2023-12-03
111 commits to main branch, last one a day ago
140
585
apache-2.0
19
🐋 Simple implementations of various popular Deep Reinforcement Learning algorithms using TensorFlow2
Created 2020-03-09
96 commits to master branch, last one 4 years ago
47
582
apache-2.0
12
Really Fast End-to-End Jax RL Implementations
Created 2023-02-25
30 commits to main branch, last one 3 months ago
59
535
apache-2.0
9
Easy and Efficient Finetuning LLMs. (Supported LLama, LLama2, LLama3, Qwen, Baichuan, GLM , Falcon) 大模型高效量化训练+部署.
Created 2023-05-25
455 commits to main branch, last one a day ago