5 results found Sort:
High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC
This repository has been archived
(exclude archived)
Created
2022-09-23
44 commits to main branch, last one about a year ago
A collection of robotics simulation environments for reinforcement learning
Created
2021-10-25
449 commits to main branch, last one 2 days ago
Clean single-file implementation of offline RL algorithms in JAX
Created
2024-01-22
354 commits to main branch, last one 3 days ago
Single-file SAC-N implementation on jax with flax and equinox. 10x faster than pytorch
Created
2022-11-28
13 commits to main branch, last one about a year ago
Codes accompanying the paper "Score Regularized Policy Optimization through Diffusion Behavior" (ICLR 2024).
Created
2023-10-11
14 commits to main branch, last one 10 months ago