2 results found Sort:
Clean baseline implementation of PPO using an episodic TransformerXL memory
Created
2022-05-04
9 commits to main branch, last one 4 months ago
Challenging Memory-based Deep Reinforcement Learning Agents
Created
2022-07-05
105 commits to main branch, last one 11 days ago