10 results found Sort:

A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more
Created 2017-12-01
220 commits to master branch, last one 6 months ago
115
1.2k
apache-2.0
19
An artificial intelligence platform for the StarCraft II with large-scale distributed training and grand-master agents.
Created 2021-07-04
71 commits to main branch, last one 6 months ago
120
1.1k
apache-2.0
13
[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)
Created 2022-10-08
187 commits to main branch, last one 2 days ago
92
1.0k
apache-2.0
12
The official implementation of Self-Play Fine-Tuning (SPIN)
Created 2024-02-04
100 commits to main branch, last one 6 months ago
62
498
apache-2.0
28
The official implementation of Self-Play Preference Optimization (SPPO)
Created 2024-06-13
26 commits to main branch, last one 4 months ago
A Massively Parallel Large Scale Self-Play Framework
Created 2022-08-17
39 commits to main branch, last one about a year ago
A custom MARL (multi-agent reinforcement learning) environment where multiple agents trade against one another (self-play) in a zero-sum continuous double auction. Ray [RLlib] is used for training.
Created 2019-07-20
309 commits to master branch, last one 4 years ago
Train a neural network to PvP in Old School RuneScape using reinforcement learning.
Created 2024-01-16
572 commits to master branch, last one 9 months ago
A very fast implementation of AlphaZero, applied to games like Splendor, Santorini, The Little Prince, … Browser version available
Created 2021-02-10
653 commits to master branch, last one 5 months ago