6 results found Sort:

RL starter files in order to immediately train, visualize and evaluate an agent without writing any line of code
Created 2018-04-11
497 commits to master branch, last one about a year ago
79
302
bsd-3-clause
15
Multi-hop knowledge graph reasoning learned via policy gradient with reward shaping and action dropout
Created 2018-09-26
32 commits to master branch, last one 4 months ago
Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO
Created 2019-04-07
25 commits to master branch, last one 2 years ago
This repo implements our paper, "Learning to Search Feasible and Infeasible Regions of Routing Problems with Flexible Neural k-Opt", which has been accepted at NeurIPS 2023.
Created 2023-08-09
19 commits to main branch, last one 6 months ago
Guide Your Agent with Adaptive Multimodal Rewards (NeurIPS 2023 Accepted)
Created 2023-09-06
10 commits to master branch, last one about a year ago