88 results found Sort:
- Filter by Primary Language:
- Python (54)
- Jupyter Notebook (9)
- C++ (6)
- Rust (3)
- TeX (2)
- JavaScript (1)
- CSS (1)
- Java (1)
- C# (1)
- +
Dopamine is a research framework for fast prototyping of reinforcement learning algorithms.
Created
2018-07-26
342 commits to master branch, last one 25 days ago
An elegant PyTorch deep reinforcement learning library.
Created
2018-04-16
729 commits to master branch, last one 8 days ago
ELF: a platform for game research with AlphaGoZero/AlphaZero reimplementation
This repository has been archived
(exclude archived)
Created
2018-04-26
68 commits to master branch, last one 4 years ago
An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)
Created
2017-12-21
49 commits to master branch, last one about a month ago
MuZero
Created
2019-12-27
132 commits to master branch, last one 2 years ago
Reinforcement Learning Coach by Intel AI Lab enables easy experimentation with state of the art Reinforcement Learning algorithms
This repository has been archived
(exclude archived)
Created
2017-10-01
524 commits to master branch, last one about a year ago
A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.
Created
2022-02-01
1,426 commits to main branch, last one 14 hours ago
A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.
Created
2020-05-05
375 commits to master branch, last one 28 days ago
[ICML 2017] TensorFlow code for Curiosity-driven Exploration for Deep Reinforcement Learning
Created
2017-05-15
10 commits to master branch, last one 5 years ago
A collection of 100+ pre-trained RL agents using Stable Baselines, training and hyperparameter optimization included.
Created
2018-10-31
237 commits to master branch, last one 3 years ago
SEED RL: Scalable and Efficient Deep-RL with Accelerated Central Inference. Implements IMPALA and R2D2 algorithms in TF2 with SEED's architecture.
This repository has been archived
(exclude archived)
Created
2019-10-14
190 commits to master branch, last one about a year ago
Python library for Reinforcement Learning.
Created
2017-02-25
1,983 commits to dev branch, last one 10 days ago
[NeurIPS'21 Outstanding Paper] Library for reliable evaluation on RL and ML benchmarks, even with only a handful of seeds.
Created
2021-08-20
66 commits to master branch, last one 3 days ago
Hearthstone simulator using C++ with some reinforcement learning
Created
2017-05-19
10,891 commits to main branch, last one 2 days ago
A curated list of Monte Carlo tree search papers with implementations.
Created
2019-11-22
112 commits to master branch, last one 2 months ago
Implementation of papers in 100 lines of code.
Created
2020-10-26
97 commits to main branch, last one 28 days ago
Stable-Baselines tutorial for Journées Nationales de la Recherche en Robotique 2019
Created
2019-10-10
75 commits to sb3 branch, last one 11 months ago
📘 The MLOps stack component for experiment tracking
Created
2019-02-11
2,065 commits to master branch, last one 3 days ago
Implementation of Inverse Reinforcement Learning (IRL) algorithms in Python/Tensorflow. Deep MaxEnt, MaxEnt, LPIRL
Created
2017-05-24
53 commits to master branch, last one 22 days ago
A curated list of applied machine learning and data science notebooks and libraries across different industries.
Created
2019-06-19
51 commits to master branch, last one 4 years ago
AI Research Platform for Reinforcement Learning from Real Panoramic Images.
Created
2017-09-29
210 commits to master branch, last one 2 years ago
Contrib package for Stable-Baselines3 - Experimental reinforcement learning (RL) code
Created
2020-09-20
143 commits to master branch, last one 2 months ago
Modular reinforcement learning library (on PyTorch and JAX) with support for NVIDIA Isaac Gym, Isaac Orbit and Omniverse Isaac Gym
Created
2021-10-18
1,392 commits to main branch, last one 3 months ago
RAD: Reinforcement Learning with Augmented Data
Created
2020-04-09
23 commits to master branch, last one 3 years ago
DrQ: Data regularized Q
Created
2020-04-29
14 commits to master branch, last one 2 years ago
A universal flight control tuning framework
Created
2018-04-10
268 commits to master branch, last one 2 years ago
OpenNARS for Research 3.0+
Created
2014-07-18
3,944 commits to master branch, last one 3 years ago
Pytorch reimplementation for "Gradient Surgery for Multi-Task Learning"
Created
2020-08-25
17 commits to master branch, last one 2 years ago
Mirror of Stable-Baselines: a fork of OpenAI Baselines, implementations of reinforcement learning algorithms
Created
2019-01-19
864 commits to master branch, last one about a year ago
Reinforcement learning theory book about foundations of deep RL algorithms with proofs.
Created
2020-10-11
70 commits to main branch, last one 5 months ago