2 results found Sort:

Clean baseline implementation of PPO using an episodic TransformerXL memory
Created 2022-05-04
9 commits to main branch, last one 10 days ago
Challenging Memory-based Deep Reinforcement Learning Agents
Created 2022-07-05
104 commits to main branch, last one about a month ago