5 results found Sort:

RL starter files in order to immediately train, visualize and evaluate an agent without writing any line of code
Created 2018-04-11
497 commits to master branch, last one about a year ago
71
298
bsd-3-clause
14
Multi-hop knowledge graph reasoning learned via policy gradient with reward shaping and action dropout
Created 2018-09-26
29 commits to master branch, last one 3 months ago
Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO
Created 2019-04-07
25 commits to master branch, last one about a year ago
This repo implements our paper, "Learning to Search Feasible and Infeasible Regions of Routing Problems with Flexible Neural k-Opt", which has been accepted at NeurIPS 2023.
Created 2023-08-09
18 commits to main branch, last one 5 days ago
Guide Your Agent with Adaptive Multimodal Rewards (NeurIPS 2023 Accepted)
Created 2023-09-06
10 commits to master branch, last one 9 months ago