30 results found Sort:

287
5.2k
apache-2.0
86
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.
Created 2024-09-15
111 commits to main branch, last one 2 days ago
A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more
Created 2017-12-01
220 commits to master branch, last one 6 months ago
An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)
Created 2017-12-21
49 commits to master branch, last one 7 months ago
120
1.1k
apache-2.0
13
[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)
Created 2022-10-08
187 commits to main branch, last one 2 days ago
Easily train AlphaZero-like agents on any environment you want!
Created 2023-12-14
47 commits to main branch, last one 10 months ago
34
342
unknown
12
MCTS project for Tetris
This repository has been archived (exclude archived)
Created 2018-02-20
426 commits to master branch, last one about a month ago
40
266
apache-2.0
19
A General Automated Machine Learning framework to simplify the development of End-to-end AutoML toolkits in specific domains.
Created 2020-06-22
1,334 commits to master branch, last one 4 months ago
42
248
gpl-3.0
21
A Deep Learning UCI-Chess Variant Engine written in C++ & Python :parrot:
Created 2018-09-09
1,530 commits to master branch, last one about a month ago
A pytorch tutorial for DRL(Deep Reinforcement Learning)
Created 2019-02-04
80 commits to master branch, last one about a year ago
11
205
unknown
3
Visualization of MCTS algorithm applied to Tic-tac-toe.
Created 2020-11-18
11 commits to master branch, last one 3 years ago
25
156
mit
8
A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each other, and investigate reliability of learned MuZero MDP models.
Created 2020-09-12
210 commits to master branch, last one 3 years ago
Reinforcement learning models in ViZDoom environment
Created 2017-01-30
103 commits to master branch, last one 2 years ago
20
105
gpl-3.0
13
Allie: A UCI compliant chess engine
Created 2019-02-13
340 commits to master branch, last one 3 years ago
Research project: create a chess engine using Deep Reinforcement Learning
Created 2021-10-25
133 commits to main branch, last one 4 months ago
11
89
gpl-3.0
2
AlphaZero based engine for the game of Go (圍棋/围棋).
Created 2021-04-02
894 commits to master branch, last one about a month ago
7
84
apache-2.0
2
fast + parallel AlphaZero in JAX
Created 2023-03-09
625 commits to main branch, last one 8 months ago
The decision-making of multiple vehicles at intersection bases on level-k game and MCTS
Created 2024-02-07
51 commits to main branch, last one 21 days ago
Meta-Zeta是一个基于强化学习的五子棋(Gobang)模型,主要用以了解AlphaGo Zero的运行原理的Demo,即神经网络是如何指导MCTS做出决策的,以及如何自我对弈学习。源码+教程
Created 2021-12-02
51 commits to main branch, last one about a year ago
18
72
unknown
6
MiniZero: An AlphaZero and MuZero Training Framework
Created 2023-10-16
432 commits to main branch, last one about a month ago
81
70
bsd-2-clause
10
Computer go engine using Monte-Carlo Tree Search (MCTS)
Created 2016-09-12
359 commits to master branch, last one 2 months ago
Monte Carlo Tree Search (MCTS) is a method for finding optimal decisions in a given domain by taking random samples in the decision space and building a search tree accordingly. It has already had a p...
Created 2019-10-10
29 commits to master branch, last one 8 months ago
Quoridor AI based on Monte Carlo tree search
Created 2019-06-21
208 commits to main branch, last one 6 months ago
基於深度學習的 GTP 圍棋(围棋)引擎,KGS 指引文件以及演算法教學。
Created 2021-08-10
134 commits to master branch, last one 3 months ago
11
56
apache-2.0
4
Computer go engine using Monte-Carlo Tree Search written in Python3.
Created 2023-01-22
152 commits to main branch, last one 7 months ago
6
43
apache-2.0
3
A rust implementation of AlphaZero algorithm
Created 2021-05-27
123 commits to main branch, last one 2 years ago
6
42
unknown
1
[ACL 2023] Solving Math Word Problems via Cooperative Reasoning induced Language Models
Created 2023-05-07
7 commits to main branch, last one 11 months ago
4
39
gpl-3.0
5
A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environments.
Created 2021-06-17
144 commits to main branch, last one 2 years ago
A fast C++ impementation of Monte Carlo Tree Search with abstract classes that a user of this library can extend in order to use it. To demonstrate it I apply it to the game of Quoridor.
Created 2021-07-29
42 commits to main branch, last one 3 years ago
MasterAI is an AI poker dedicated to suport n-play (single- or multi-agent) Texas Hold'em imperfect-informatin games.。MasterAI v2.0是从MasterAI v1.0衍生出来的迭代算法,它在非完全信息游戏中利用了通用的强化学习+搜索,并在一对一无限押注的德州扑克中实现了超人...
Created 2021-09-22
9 commits to main branch, last one 11 hours ago