5 results found Sort:
- Filter by Primary Language:
- Python (3)
- Jupyter Notebook (2)
- +
RL starter files in order to immediately train, visualize and evaluate an agent without writing any line of code
Created
2018-04-11
497 commits to master branch, last one about a year ago
Multi-hop knowledge graph reasoning learned via policy gradient with reward shaping and action dropout
Created
2018-09-26
29 commits to master branch, last one 3 months ago
Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO
Created
2019-04-07
25 commits to master branch, last one about a year ago
This repo implements our paper, "Learning to Search Feasible and Infeasible Regions of Routing Problems with Flexible Neural k-Opt", which has been accepted at NeurIPS 2023.
Created
2023-08-09
18 commits to main branch, last one 5 days ago
Guide Your Agent with Adaptive Multimodal Rewards (NeurIPS 2023 Accepted)
Created
2023-09-06
10 commits to master branch, last one 9 months ago