27 results found Sort:

A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more
Created 2017-12-01
220 commits to master branch, last one about a month ago
An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)
Created 2017-12-21
49 commits to master branch, last one 2 months ago
89
939
apache-2.0
11
[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios
Created 2022-10-08
148 commits to main branch, last one a day ago
Easily train AlphaZero-like agents on any environment you want!
Created 2023-12-14
47 commits to main branch, last one 5 months ago
34
338
unknown
13
MCTS project for Tetris
Created 2018-02-20
420 commits to master branch, last one 3 years ago
40
260
apache-2.0
19
A General Automated Machine Learning framework to simplify the development of End-to-end AutoML toolkits in specific domains.
Created 2020-06-22
1,333 commits to master branch, last one 6 days ago
42
231
gpl-3.0
19
A Deep Learning UCI-Chess Variant Engine written in C++ & Python :parrot:
Created 2018-09-09
1,519 commits to master branch, last one about a month ago
A pytorch tutorial for DRL(Deep Reinforcement Learning)
Created 2019-02-04
80 commits to master branch, last one about a year ago
11
167
unknown
3
Visualization of MCTS algorithm applied to Tic-tac-toe.
Created 2020-11-18
11 commits to master branch, last one 2 years ago
23
147
mit
8
A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each other, and investigate reliability of learned MuZero MDP models.
Created 2020-09-12
210 commits to master branch, last one 3 years ago
Reinforcement learning models in ViZDoom environment
Created 2017-01-30
103 commits to master branch, last one 2 years ago
21
104
gpl-3.0
13
Allie: A UCI compliant chess engine
Created 2019-02-13
340 commits to master branch, last one 3 years ago
Research project: create a chess engine using Deep Reinforcement Learning
Created 2021-10-25
132 commits to main branch, last one about a month ago
5
72
apache-2.0
2
fast + parallel AlphaZero in JAX
Created 2023-03-09
625 commits to main branch, last one 3 months ago
81
70
bsd-2-clause
10
Computer go engine using Monte-Carlo Tree Search (MCTS)
Created 2016-09-12
350 commits to master branch, last one 3 months ago
Meta-Zeta是一个基于强化学习的五子棋(Gobang)模型,主要用以了解AlphaGo Zero的运行原理的Demo,即神经网络是如何指导MCTS做出决策的,以及如何自我对弈学习。源码+教程
Created 2021-12-02
51 commits to main branch, last one about a year ago
7
65
gpl-3.0
2
AlphaZero based engine for the game of Go (圍棋/围棋).
Created 2021-04-02
828 commits to master branch, last one 7 days ago
Monte Carlo Tree Search (MCTS) is a method for finding optimal decisions in a given domain by taking random samples in the decision space and building a search tree accordingly. It has already had a p...
Created 2019-10-10
29 commits to master branch, last one 3 months ago
Quoridor AI based on Monte Carlo tree search
Created 2019-06-21
208 commits to main branch, last one about a month ago
9
55
unknown
5
MiniZero: An AlphaZero and MuZero Training Framework
Created 2023-10-16
417 commits to main branch, last one 8 days ago
11
52
apache-2.0
4
Computer go engine using Monte-Carlo Tree Search written in Python3.
Created 2023-01-22
152 commits to main branch, last one 3 months ago
基於深度學習的 GTP 圍棋(围棋)引擎,KGS 指引文件以及演算法教學。
Created 2021-08-10
130 commits to master branch, last one 5 months ago
5
41
apache-2.0
3
A rust implementation of AlphaZero algorithm
Created 2021-05-27
123 commits to main branch, last one about a year ago
The decision-making of multiple vehicles at intersection bases on level-k game and MCTS
Created 2024-02-07
48 commits to main branch, last one about a month ago
4
36
gpl-3.0
5
A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environments.
Created 2021-06-17
144 commits to main branch, last one about a year ago
5
32
unknown
1
[ACL 2023] Solving Math Word Problems via Cooperative Reasoning induced Language Models
Created 2023-05-07
7 commits to main branch, last one 6 months ago