3 results found Sort:
Pytorch LSTM RNN for reinforcement learning to play Atari games from OpenAI Universe. We also use Google Deep Mind's Asynchronous Advantage Actor-Critic (A3C) Algorithm. This is much superior and effi...
Created
2017-09-27
14 commits to master branch, last one 3 months ago
A tour of different optimization algorithms in PyTorch.
Created
2020-11-14
22 commits to main branch, last one 3 years ago
Notes about LLaMA 2 model
Created
2023-08-21
4 commits to main branch, last one about a year ago