3 results found Sort:

Pytorch LSTM RNN for reinforcement learning to play Atari games from OpenAI Universe. We also use Google Deep Mind's Asynchronous Advantage Actor-Critic (A3C) Algorithm. This is much superior and effi...
Created 2017-09-27
14 commits to master branch, last one 3 months ago
A tour of different optimization algorithms in PyTorch.
Created 2020-11-14
22 commits to main branch, last one 3 years ago
Notes about LLaMA 2 model
Created 2023-08-21
4 commits to main branch, last one about a year ago