24 results found Sort:
- Filter by Primary Language:
- Python (21)
- Jupyter Notebook (2)
- +
High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC
This repository has been archived
(exclude archived)
Created
2022-09-23
44 commits to main branch, last one about a year ago
JAX (Flax) implementation of algorithms for Deep Reinforcement Learning with continuous action spaces.
Created
2021-01-16
133 commits to main branch, last one 2 years ago
📚 List of Top-tier Conference Papers on Reinforcement Learning (RL),including: NeurIPS, ICML, AAAI, IJCAI, AAMAS, ICLR, ICRA, etc.
dqn
aaai
icml
ijcai
neurips
rl-papers
q-learning
policy-gradient
imitation-learning
reinforcement-learning
artificial-intelligence
deep-reinforcement-learning
meta-reinforcement-learning
reinforcement-learning-paper
reinforcement-learning-papers
offline-reinforcement-learning
multi-agent-reinforcement-learning
reinforcement-learning-conferences
hierarchical-reinforcement-learning
reinforcement-learning-conferences-papers
Created
2023-01-05
117 commits to main branch, last one 6 months ago
An elegant PyTorch offline reinforcement learning library for researchers.
Created
2022-07-16
77 commits to main branch, last one 8 months ago
A Japanese (Riichi) Mahjong AI Framework
Created
2021-08-05
339 commits to main branch, last one 10 months ago
Minimal implementation of Decision Transformer: Reinforcement Learning via Sequence Modeling in PyTorch for mujoco control tasks in OpenAI gym
Created
2022-02-12
30 commits to master branch, last one 2 years ago
A collection of offline reinforcement learning algorithms.
Created
2021-02-18
148 commits to master branch, last one 25 days ago
Datasets with baselines for offline multi-agent reinforcement learning.
Created
2022-11-08
556 commits to main branch, last one about a month ago
PyTorch implementation of the Offline Reinforcement Learning algorithm CQL. Includes the versions DQN-CQL and SAC-CQL for discrete and continuous action spaces.
Created
2021-08-23
35 commits to main branch, last one about a year ago
Clean single-file implementation of offline RL algorithms in JAX
Created
2024-01-22
354 commits to main branch, last one 3 days ago
Python interface for accessing the near real-world offline reinforcement learning (NeoRL) benchmark datasets
Created
2021-02-07
19 commits to benchmark branch, last one about a month ago
Code release for Efficient Planning in a Compact Latent Action Space (ICLR2023) https://arxiv.org/abs/2208.10291.
Created
2022-08-23
14 commits to main branch, last one about a year ago
[ICLR 2024] The official implementation of "Safe Offline Reinforcement Learning with Feasibility-Guided Diffusion Model"
Created
2023-10-29
31 commits to master branch, last one 2 days ago
Official PyTorch implementation of "Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble" (NeurIPS'21)
Created
2021-10-10
57 commits to main branch, last one 2 years ago
Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and observation spaces, including both discrete and continuous variation...
Created
2023-01-17
26 commits to main branch, last one about a year ago
[NeurIPS 2022 Oral] The official implementation of POR in "A Policy-Guided Imitation Approach for Offline Reinforcement Learning"
Created
2022-07-29
19 commits to main branch, last one about a year ago
Author's implementation of ReBRAC, a minimalist improvement upon TD3+BC
This repository has been archived
(exclude archived)
Created
2023-05-07
27 commits to public-release branch, last one about a year ago
Official implementation for "Anti-Exploration by Random Network Distillation", ICML 2023
This repository has been archived
(exclude archived)
Created
2023-01-30
3 commits to main branch, last one about a year ago
Code for FOCAL Paper Published at ICLR 2021
Created
2020-10-02
27 commits to master branch, last one about a year ago
Single-file SAC-N implementation on jax with flax and equinox. 10x faster than pytorch
Created
2022-11-28
13 commits to main branch, last one about a year ago
Official implementation of "Direct Preference-based Policy Optimization without Reward Modeling" (NeurIPS 2023)
Created
2023-10-08
13 commits to main branch, last one 5 months ago
[ICML 2022] The official implementation of DWBC in "Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations"
Created
2022-06-13
12 commits to main branch, last one about a year ago
[NeurIPS 2023] The official implementation of "Offline Multi-Agent Reinforcement Learning with Implicit Global-to-Local Value Regularization"
Created
2023-10-13
12 commits to master branch, last one 9 months ago
A Production Tool for Embodied AI
Created
2023-08-14
131 commits to main branch, last one 5 months ago