1 result found Sort:
A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environments.
Created
2021-06-17
144 commits to main branch, last one 2 years ago