8 results found Sort:

PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....
Created 2018-06-09
4 commits to master branch, last one 4 years ago
72
388
unknown
5
Code for paper "Computation Offloading Optimization for UAV-assisted Mobile Edge Computing: A Deep Deterministic Policy Gradient Approach"
Created 2021-05-02
10 commits to main branch, last one 5 months ago
A Pytorch implementation of the multi agent deep deterministic policy gradients (MADDPG) algorithm
Created 2021-04-08
4 commits to master branch, last one 3 years ago
PyTorch implementation of Soft-Actor-Critic and Prioritized Experience Replay (PER) + Emphasizing Recent Experience (ERE) + Munchausen RL + D2RL and parallel Environments.
Created 2020-02-20
58 commits to master branch, last one 3 years ago
Solutions of assignments of Deep Reinforcement Learning course presented by the University of California, Berkeley (CS285) in Pytorch framework
Created 2020-06-13
38 commits to master branch, last one 3 years ago
强化学习算法库,包含了目前主流的强化学习算法(Value based and Policy based)的代码,代码都经过调试并可以运行
Created 2021-12-01
193 commits to main branch, last one 8 months ago
Causal RL: Reverse-Environment Network Integrated Actor-Critic Algorithm
Created 2023-09-27
693 commits to release-v2.0 branch, last one about a month ago