4 results found Sort:

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) ...
Created 2017-08-22
274 commits to master branch, last one 3 years ago
{KFAC,EKFAC,Diagonal,Implicit} Fisher Matrices and finite width NTKs in PyTorch
Created 2019-09-12
500 commits to master branch, last one 15 days ago
Distributed K-FAC Preconditioner for PyTorch
Created 2020-02-16
665 commits to main branch, last one 6 days ago
Bayesian Low-Rank Adaptation for Large Language Models
Created 2024-01-19
41 commits to master branch, last one 9 months ago