21 results found Sort:
- Filter by Primary Language:
- Python (17)
- Julia (1)
- Jupyter Notebook (1)
- +
OpenDILab Decision AI Engine
drl
r2d2
smac
atari
impala
mujoco
python
minigrid
self-play
offline-rl
pytorch-rl
distributed-system
imitation-learning
reinforcement-learning
exploration-exploitation
inverse-reinforcement-learning
multiagent-reinforcement-learning
reinforcement-learning-algorithms
distributed-reinforcement-learning
model-based-reinforcement-learning
Created
2021-07-04
802 commits to main branch, last one 2 days ago
A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.
Created
2022-02-01
1,426 commits to main branch, last one 15 hours ago
Library for Model Based RL
Created
2020-08-17
313 commits to main branch, last one 10 months ago
A curated list of awesome model based RL resources (continually updated)
Created
2021-12-28
37 commits to main branch, last one 11 days ago
Code to reproduce the experiments in Sample Efficient Reinforcement Learning via Model-Ensemble Exploration and Exploitation (MEEE).
Created
2020-11-09
27 commits to main branch, last one 9 months ago
DI-engine docs (Chinese and English)
Created
2021-07-09
324 commits to main branch, last one 23 days ago
Related papers for reinforcement learning, including classic papers and latest papers in top conferences
iclr22
iclr23
iclr24
icml22
icml23
neurips22
neurips23
offline-rl
world-models
model-free-rl
model-based-rl
reinforcement-learning
deep-reinforcement-learning
meta-reinforcement-learning
reinforcement-learning-papers
multi-task-reinforcement-learning
model-based-reinforcement-learning
unsupervised-reinforcement-learning
generalization-reinforcement-learning
representation-reinforcement-learning
Created
2021-10-29
475 commits to main branch, last one 9 days ago
Unofficial Pytorch code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"
Created
2019-01-03
32 commits to master branch, last one 4 years ago
15
116
mit
4
Unofficial Implementation of the paper "Data-Efficient Reinforcement Learning with Probabilistic Model Predictive Control", applied to gym environments
Created
2020-11-15
166 commits to master branch, last one about a year ago
(Experimental, a lot of bugs) Advanced automatic fingering generator for piano scores, determining optimal fingering using Model-Based Reinforcement Learning, written in the Julia language.
Created
2023-04-21
20 commits to main branch, last one 8 months ago
Code release for Efficient Planning in a Compact Latent Action Space (ICLR2023) https://arxiv.org/abs/2208.10291.
Created
2022-08-23
14 commits to main branch, last one about a year ago
Deep active inference agents using Monte-Carlo methods
Created
2020-06-03
22 commits to master branch, last one 2 years ago
Latent Imagination Facilitates Zero-Shot Transfer in Autonomous Racing
Created
2021-03-01
149 commits to main branch, last one about a year ago
Code for "World Model as a Graph: Learning Latent Landmarks for Planning" (ICML 2021 Long Presentation)
Created
2021-05-30
7 commits to main branch, last one 2 years ago
Adaptable tools to make reinforcement learning and evolutionary computation algorithms.
Created
2021-09-08
609 commits to main branch, last one 2 years ago
Model-based reinforcement learning in TensorFlow
Created
2021-03-15
17 commits to develop branch, last one 2 years ago
An implementation of MuZero in JAX.
Created
2022-02-21
11 commits to main branch, last one 2 years ago
JAX code for the paper "Control-Oriented Model-Based Reinforcement Learning with Implicit Differentiation"
Created
2021-06-06
10 commits to main branch, last one 2 years ago
Recall 2 Imagine, a World Model with superhuman memory. Oral (1.2%) @ ICLR 2024
Created
2024-03-25
1 commits to main branch, last one 2 months ago
A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environments.
Created
2021-06-17
144 commits to main branch, last one about a year ago
The Hierarchical Intrinsically Motivated Agent (HIMA) is an algorithm that is intended to exhibit an adaptive goal-directed behavior using neurophysiological models of the neocortex, basal ganglia, an...
Created
2022-02-22
1,555 commits to main branch, last one 14 hours ago