30 results found Sort:
- Filter by Primary Language:
- Python (14)
- C++ (7)
- Jupyter Notebook (5)
- JavaScript (2)
- Rust (1)
- +
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.
Created
2024-09-15
111 commits to main branch, last one 2 days ago
A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more
Created
2017-12-01
220 commits to master branch, last one 6 months ago
An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)
Created
2017-12-21
49 commits to master branch, last one 7 months ago
MuZero
Created
2019-12-27
132 commits to master branch, last one 2 years ago
[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)
Created
2022-10-08
187 commits to main branch, last one 2 days ago
Easily train AlphaZero-like agents on any environment you want!
Created
2023-12-14
47 commits to main branch, last one 10 months ago
MCTS project for Tetris
This repository has been archived
(exclude archived)
Created
2018-02-20
426 commits to master branch, last one about a month ago
A General Automated Machine Learning framework to simplify the development of End-to-end AutoML toolkits in specific domains.
Created
2020-06-22
1,334 commits to master branch, last one 4 months ago
A Deep Learning UCI-Chess Variant Engine written in C++ & Python :parrot:
Created
2018-09-09
1,530 commits to master branch, last one about a month ago
A pytorch tutorial for DRL(Deep Reinforcement Learning)
Created
2019-02-04
80 commits to master branch, last one about a year ago
Visualization of MCTS algorithm applied to Tic-tac-toe.
Created
2020-11-18
11 commits to master branch, last one 3 years ago
A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each other, and investigate reliability of learned MuZero MDP models.
Created
2020-09-12
210 commits to master branch, last one 3 years ago
Reinforcement learning models in ViZDoom environment
Created
2017-01-30
103 commits to master branch, last one 2 years ago
Allie: A UCI compliant chess engine
Created
2019-02-13
340 commits to master branch, last one 3 years ago
Research project: create a chess engine using Deep Reinforcement Learning
Created
2021-10-25
133 commits to main branch, last one 4 months ago
AlphaZero based engine for the game of Go (圍棋/围棋).
Created
2021-04-02
894 commits to master branch, last one about a month ago
fast + parallel AlphaZero in JAX
Created
2023-03-09
625 commits to main branch, last one 8 months ago
The decision-making of multiple vehicles at intersection bases on level-k game and MCTS
Created
2024-02-07
51 commits to main branch, last one 21 days ago
Meta-Zeta是一个基于强化学习的五子棋(Gobang)模型,主要用以了解AlphaGo Zero的运行原理的Demo,即神经网络是如何指导MCTS做出决策的,以及如何自我对弈学习。源码+教程
Created
2021-12-02
51 commits to main branch, last one about a year ago
MiniZero: An AlphaZero and MuZero Training Framework
Created
2023-10-16
432 commits to main branch, last one about a month ago
Computer go engine using Monte-Carlo Tree Search (MCTS)
Created
2016-09-12
359 commits to master branch, last one 2 months ago
Monte Carlo Tree Search (MCTS) is a method for finding optimal decisions in a given domain by taking random samples in the decision space and building a search tree accordingly. It has already had a p...
Created
2019-10-10
29 commits to master branch, last one 8 months ago
Quoridor AI based on Monte Carlo tree search
Created
2019-06-21
208 commits to main branch, last one 6 months ago
基於深度學習的 GTP 圍棋(围棋)引擎,KGS 指引文件以及演算法教學。
Created
2021-08-10
134 commits to master branch, last one 3 months ago
Computer go engine using Monte-Carlo Tree Search written in Python3.
Created
2023-01-22
152 commits to main branch, last one 7 months ago
A rust implementation of AlphaZero algorithm
Created
2021-05-27
123 commits to main branch, last one 2 years ago
[ACL 2023] Solving Math Word Problems via Cooperative Reasoning induced Language Models
Created
2023-05-07
7 commits to main branch, last one 11 months ago
A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environments.
Created
2021-06-17
144 commits to main branch, last one 2 years ago
A fast C++ impementation of Monte Carlo Tree Search with abstract classes that a user of this library can extend in order to use it. To demonstrate it I apply it to the game of Quoridor.
Created
2021-07-29
42 commits to main branch, last one 3 years ago
MasterAI is an AI poker dedicated to suport n-play (single- or multi-agent) Texas Hold'em imperfect-informatin games.。MasterAI v2.0是从MasterAI v1.0衍生出来的迭代算法,它在非完全信息游戏中利用了通用的强化学习+搜索,并在一对一无限押注的德州扑克中实现了超人...
Created
2021-09-22
9 commits to main branch, last one 11 hours ago