Search Results - RepositoryStats

633

2.6k

mit

74

MuZero

rl gym mcts muzero alphago python3 pytorch alphazero tensorboard deep-learning self-learning model-based-rl muzero-general neural-network machine-learning residual-network reinforcement-learning monte-carlo-tree-search deep-reinforcement-learning

Created 2019-12-27

132 commits to master branch, last one 2 years ago

LightZero opendilab

146

1.3k

apache-2.0

11

[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)

Created 2022-10-08

203 commits to main branch, last one a day ago

xingtian huawei-noah

89

311

mit

12

xingtian is a componentized library for the development and verification of reinforcement learning algorithms

dqn ppo qmix impala muzero reinforcement-learning-algorithms

Created 2020-08-15

93 commits to master branch, last one 3 years ago

MuZero johan-gras

54

207

unknown

10

A structured implementation of MuZero

muzero tensorflow world-models reinforcement-learning

Created 2019-12-08

13 commits to master branch, last one 5 years ago

muzero kaesve

26

157

mit

7

A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each other, and investigate reliability of learned MuZero MDP models.

tf2 mcts muzero alphazero tensorflow tensorflow2 deep-learning reinforcement-learning deep-reinforcement-learning

Created 2020-09-12

210 commits to master branch, last one 3 years ago

computer-go-dataset yenw

40

151

unknown

17

datasets for computer go

go sgf tygem golaxy minigo muzero alphago fineart alphazero leelazero phoenixgo elf-opengo computer-go computer-go-dataset

Created 2017-04-12

440 commits to master branch, last one 9 months ago

minizero rlglab

22

84

unknown

7

MiniZero: An AlphaZero and MuZero Training Framework

go hex mcts nogo atari gomoku muzero othello alphazero tictactoe killall-go board-games gumbel-muzero gumbel-alphazero outer-open-gomoku reinforcement-learning monte-carlo-tree-search deep-reinforcement-learning

Created 2023-10-16

442 commits to main branch, last one 28 days ago

Stochastic-muzero DHDev0

10

64

gpl-3.0

4

Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and observation spaces, including both discrete and continuous variation...

rl lstm muzero pytorch resnetv2 transformer arxiv-papers gym-environments machine-learning muzero-stochastic stochastic-muzero multilayer-perceptron monte-carlo-tree-search deep-reinforcement-learning online-reinforcement-learning offline-reinforcement-learning

Created 2023-01-17

26 commits to main branch, last one about a year ago

jax_muzero Hwhitetooth

7

56

mit

3

An implementation of MuZero in JAX.

jax muzero deep-learning reinforcement-learning deep-reinforcement-learning model-based-reinforcement-learning

Created 2022-02-21

11 commits to main branch, last one 3 years ago

omega hr0nix

4

40

gpl-3.0

4

A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environments.

jax flax mcts rlax muzero nethack minihack model-based-rl reinforcement-learning model-based-reinforcement-learning

Created 2021-06-17

144 commits to main branch, last one 2 years ago

muzero-cpp tuero

9

36

apache-2.0

5

A C++ pytorch implementation of MuZero

cpp mcts muzero pytorch libtorch alphazero machine-learning reinforcement-learning

Created 2021-11-24

34 commits to master branch, last one 10 months ago

Muzero-unplugged DHDev0

2

27

gpl-3.0

2

Pytorch Implementation of MuZero Unplugged for gym environment. This algorithm is capable of supporting a wide range of action and observation spaces, including both discrete and continuous variations...

rl gym lstm arxiv muzero python3 pytorch resnetv1 resnetv2 transformer arxiv-papers deep-learning neural-network gym-environments machine-learning muzero-unplugged reinforcement-learning monte-carlo-tree-search deep-reinforcement-learning

Created 2023-01-07

44 commits to main branch, last one 2 years ago