5 results found Sort:

An index of algorithms for offline reinforcement learning (offline-rl)
Created 2020-12-04
320 commits to main branch, last one 10 months ago
89
648
apache-2.0
88
Open Bandit Pipeline: a python library for bandit algorithms and off-policy evaluation
Created 2020-06-16
1,022 commits to master branch, last one 2 years ago
Implementations and examples of common offline policy evaluation methods in Python.
Created 2020-03-10
152 commits to master branch, last one 2 years ago
SCOPE-RL: A python library for offline reinforcement learning, off-policy evaluation, and selection
Created 2020-12-18
800 commits to main branch, last one about a year ago