4 results found Sort:

🔬 Research Framework for Single and Multi-Players 🎰 Multi-Arms Bandits (MAB) Algorithms, implementing all the state-of-the-art algorithms for single-player (UCB, KL-UCB, Thompson...) and multi-playe...
Created 2016-11-17
1,772 commits to master branch, last one 5 months ago
22
256
mit
9
A hyperparameter optimization framework, inspired by Optuna.
Created 2019-07-24
798 commits to main branch, last one 3 months ago
25
54
unlicense
5
My solutions to Yandex Practical Reinforcement Learning course in PyTorch and Tensorflow
Created 2018-05-10
710 commits to master branch, last one 6 years ago