1 result found Sort:
Reinforcement Learning Short Course
offline-rl
q-learning
ridesharing
deep-q-network
model-based-rl
policy-gradient
value-iteration
policy-iteration
fitted-q-iteration
dynamic-programming
monte-carlo-methods
policy-based-method
off-policy-evaluation
reinforcement-learning
markov-decision-processes
order-dispatch-recommendation
temporal-differencing-learning
Created
2023-02-07
89 commits to main branch, last one 3 days ago