10 results found Sort:

89
939
apache-2.0
11
[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios
Created 2022-10-08
148 commits to main branch, last one a day ago
xingtian is a componentized library for the development and verification of reinforcement learning algorithms
Created 2020-08-15
93 commits to master branch, last one 2 years ago
56
202
unknown
10
A structured implementation of MuZero
Created 2019-12-08
13 commits to master branch, last one 4 years ago
23
147
mit
8
A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each other, and investigate reliability of learned MuZero MDP models.
Created 2020-09-12
210 commits to master branch, last one 3 years ago
datasets for computer go
Created 2017-04-12
440 commits to master branch, last one 17 days ago
9
55
unknown
5
MiniZero: An AlphaZero and MuZero Training Framework
Created 2023-10-16
417 commits to main branch, last one 7 days ago
An implementation of MuZero in JAX.
Created 2022-02-21
11 commits to main branch, last one 2 years ago
Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and observation spaces, including both discrete and continuous variation...
Created 2023-01-17
26 commits to main branch, last one about a year ago
4
36
gpl-3.0
5
A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environments.
Created 2021-06-17
144 commits to main branch, last one about a year ago