11 results found Sort:

119
1.1k
apache-2.0
12
[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)
Created 2022-10-08
182 commits to main branch, last one 2 days ago
xingtian is a componentized library for the development and verification of reinforcement learning algorithms
Created 2020-08-15
93 commits to master branch, last one 2 years ago
54
206
unknown
10
A structured implementation of MuZero
Created 2019-12-08
13 commits to master branch, last one 4 years ago
25
156
mit
8
A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each other, and investigate reliability of learned MuZero MDP models.
Created 2020-09-12
210 commits to master branch, last one 3 years ago
datasets for computer go
Created 2017-04-12
440 commits to master branch, last one 4 months ago
18
72
unknown
6
MiniZero: An AlphaZero and MuZero Training Framework
Created 2023-10-16
432 commits to main branch, last one 27 days ago
Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and observation spaces, including both discrete and continuous variation...
Created 2023-01-17
26 commits to main branch, last one about a year ago
An implementation of MuZero in JAX.
Created 2022-02-21
11 commits to main branch, last one 2 years ago
4
39
gpl-3.0
5
A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environments.
Created 2021-06-17
144 commits to main branch, last one 2 years ago
Pytorch Implementation of MuZero Unplugged for gym environment. This algorithm is capable of supporting a wide range of action and observation spaces, including both discrete and continuous variations...
Created 2023-01-07
44 commits to main branch, last one about a year ago