5 results found Sort:
- Filter by Primary Language:
- Python (3)
- Jupyter Notebook (1)
- +
An index of algorithms for offline reinforcement learning (offline-rl)
Created
2020-12-04
320 commits to main branch, last one 10 months ago
Open Bandit Pipeline: a python library for bandit algorithms and off-policy evaluation
Created
2020-06-16
1,022 commits to master branch, last one 2 years ago
Implementations and examples of common offline policy evaluation methods in Python.
Created
2020-03-10
152 commits to master branch, last one 2 years ago
SCOPE-RL: A python library for offline reinforcement learning, off-policy evaluation, and selection
Created
2020-12-18
800 commits to main branch, last one about a year ago
Reinforcement Learning Short Course
offline-rl
q-learning
ridesharing
deep-q-network
model-based-rl
policy-gradient
value-iteration
policy-iteration
fitted-q-iteration
dynamic-programming
monte-carlo-methods
policy-based-method
off-policy-evaluation
reinforcement-learning
markov-decision-processes
order-dispatch-recommendation
temporal-differencing-learning
Created
2023-02-07
90 commits to main branch, last one 5 days ago