35 results found Sort:

364
6.6k
apache-2.0
107
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.
Created 2024-09-15
238 commits to main branch, last one 3 days ago
A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more
Created 2017-12-01
221 commits to master branch, last one 2 months ago
An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)
Created 2017-12-21
49 commits to master branch, last one 10 months ago
145
1.3k
apache-2.0
11
[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)
Created 2022-10-08
200 commits to main branch, last one 4 days ago
Latest Advances on System-2 Reasoning
Created 2025-02-09
46 commits to main branch, last one a day ago
Easily train AlphaZero-like agents on any environment you want!
Created 2023-12-14
47 commits to main branch, last one about a year ago
34
345
unknown
11
MCTS project for Tetris
This repository has been archived (exclude archived)
Created 2018-02-20
426 commits to master branch, last one 5 months ago
40
266
apache-2.0
17
A General Automated Machine Learning framework to simplify the development of End-to-end AutoML toolkits in specific domains.
Created 2020-06-22
1,334 commits to master branch, last one 8 months ago
44
260
gpl-3.0
19
A Deep Learning UCI-Chess Variant Engine written in C++ & Python :parrot:
Created 2018-09-09
1,532 commits to master branch, last one about a month ago
12
228
unknown
2
Visualization of MCTS algorithm applied to Tic-tac-toe.
Created 2020-11-18
11 commits to master branch, last one 3 years ago
A pytorch tutorial for DRL(Deep Reinforcement Learning)
Created 2019-02-04
80 commits to master branch, last one about a year ago
An asynchronous/parallel method of AlphaGo Zero algorithm with Gomoku
Created 2018-12-12
21 commits to master branch, last one 17 days ago
26
157
mit
7
A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each other, and investigate reliability of learned MuZero MDP models.
Created 2020-09-12
210 commits to master branch, last one 3 years ago
12
131
gpl-3.0
8
Research project: create a chess engine using Deep Reinforcement Learning
Created 2021-10-25
133 commits to main branch, last one 8 months ago
Reinforcement learning models in ViZDoom environment
Created 2017-01-30
103 commits to master branch, last one 3 years ago
The decision-making of multiple vehicles at intersection bases on level-k game and MCTS
Created 2024-02-07
53 commits to main branch, last one about a month ago
21
105
gpl-3.0
13
Allie: A UCI compliant chess engine
Created 2019-02-13
340 commits to master branch, last one 4 years ago
10
95
gpl-3.0
3
AlphaZero based engine for the game of Go (圍棋/围棋).
Created 2021-04-02
937 commits to master branch, last one 21 days ago
9
93
apache-2.0
1
fast + parallel AlphaZero in JAX
Created 2023-03-09
632 commits to main branch, last one 2 months ago
Meta-Zeta是一个基于强化学习的五子棋(Gobang)模型,主要用以了解AlphaGo Zero的运行原理的Demo,即神经网络是如何指导MCTS做出决策的,以及如何自我对弈学习。源码+教程
Created 2021-12-02
51 commits to main branch, last one 2 years ago
22
84
unknown
7
MiniZero: An AlphaZero and MuZero Training Framework
Created 2023-10-16
442 commits to main branch, last one 20 days ago
79
76
bsd-2-clause
10
Computer go engine using Monte-Carlo Tree Search (MCTS)
Created 2016-09-12
359 commits to master branch, last one 5 months ago
Quoridor AI based on Monte Carlo tree search
Created 2019-06-21
208 commits to main branch, last one 10 months ago
Monte Carlo Tree Search (MCTS) is a method for finding optimal decisions in a given domain by taking random samples in the decision space and building a search tree accordingly. It has already had a p...
Created 2019-10-10
29 commits to master branch, last one about a year ago
This is a repo for showcasing using MCTS with LLMs to solve gsm8k problems
Created 2025-01-04
9 commits to main branch, last one 2 months ago
12
64
apache-2.0
3
Computer go engine using Monte-Carlo Tree Search written in Python3.
Created 2023-01-22
178 commits to main branch, last one 3 months ago
基於深度學習的 GTP 圍棋(围棋)引擎,KGS 指引文件以及演算法教學。
Created 2021-08-10
138 commits to master branch, last one 17 days ago
8
48
unknown
2
[ACL 2023] Solving Math Word Problems via Cooperative Reasoning induced Language Models (LLMs + MCTS + Self-Improvement)
Created 2023-05-07
7 commits to main branch, last one about a year ago
6
47
apache-2.0
2
A rust implementation of AlphaZero algorithm
Created 2021-05-27
123 commits to main branch, last one 2 years ago