10 results found Sort:
- Filter by Primary Language:
- Python (7)
- Jupyter Notebook (2)
- Java (1)
- +
A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more
Created
2017-12-01
220 commits to master branch, last one 7 months ago
OpenDILab Decision AI Engine. The Most Comprehensive Reinforcement Learning Framework B.P.
drl
r2d2
smac
atari
impala
mujoco
python
minigrid
self-play
offline-rl
pytorch-rl
distributed-system
imitation-learning
reinforcement-learning
exploration-exploitation
inverse-reinforcement-learning
multiagent-reinforcement-learning
reinforcement-learning-algorithms
distributed-reinforcement-learning
model-based-reinforcement-learning
Created
2021-07-04
835 commits to main branch, last one 2 days ago
An artificial intelligence platform for the StarCraft II with large-scale distributed training and grand-master agents.
Created
2021-07-04
71 commits to main branch, last one 7 months ago
[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)
Created
2022-10-08
190 commits to main branch, last one 11 days ago
The official implementation of Self-Play Fine-Tuning (SPIN)
Created
2024-02-04
100 commits to main branch, last one 7 months ago
The official implementation of Self-Play Preference Optimization (SPPO)
Created
2024-06-13
27 commits to main branch, last one 28 days ago
A Massively Parallel Large Scale Self-Play Framework
Created
2022-08-17
39 commits to main branch, last one about a year ago
A custom MARL (multi-agent reinforcement learning) environment where multiple agents trade against one another (self-play) in a zero-sum continuous double auction. Ray [RLlib] is used for training.
Created
2019-07-20
309 commits to master branch, last one 4 years ago
Train a neural network to PvP in Old School RuneScape using reinforcement learning.
Created
2024-01-16
572 commits to master branch, last one 10 months ago
A very fast implementation of AlphaZero, applied to games like Splendor, Santorini, The Little Prince, … Browser version available
Created
2021-02-10
653 commits to master branch, last one 6 months ago