12 results found Sort:

146
1.3k
apache-2.0
11
[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)
Created 2022-10-08
203 commits to main branch, last one a day ago
xingtian is a componentized library for the development and verification of reinforcement learning algorithms
Created 2020-08-15
93 commits to master branch, last one 3 years ago
54
207
unknown
10
A structured implementation of MuZero
Created 2019-12-08
13 commits to master branch, last one 5 years ago
26
157
mit
7
A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each other, and investigate reliability of learned MuZero MDP models.
Created 2020-09-12
210 commits to master branch, last one 3 years ago
datasets for computer go
Created 2017-04-12
440 commits to master branch, last one 9 months ago
22
84
unknown
7
MiniZero: An AlphaZero and MuZero Training Framework
Created 2023-10-16
442 commits to main branch, last one 28 days ago
Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and observation spaces, including both discrete and continuous variation...
Created 2023-01-17
26 commits to main branch, last one about a year ago
An implementation of MuZero in JAX.
Created 2022-02-21
11 commits to main branch, last one 3 years ago
4
40
gpl-3.0
4
A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environments.
Created 2021-06-17
144 commits to main branch, last one 2 years ago
9
36
apache-2.0
5
A C++ pytorch implementation of MuZero
Created 2021-11-24
34 commits to master branch, last one 10 months ago
Pytorch Implementation of MuZero Unplugged for gym environment. This algorithm is capable of supporting a wide range of action and observation spaces, including both discrete and continuous variations...
Created 2023-01-07
44 commits to main branch, last one 2 years ago