4 results found Sort:
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) ...
Created
2017-08-22
274 commits to master branch, last one 3 years ago
{KFAC,EKFAC,Diagonal,Implicit} Fisher Matrices and finite width NTKs in PyTorch
Created
2019-09-12
457 commits to master branch, last one 3 months ago
Distributed K-FAC Preconditioner for PyTorch
Created
2020-02-16
648 commits to main branch, last one 2 days ago
Bayesian Low-Rank Adaptation for Large Language Models
Created
2024-01-19
41 commits to master branch, last one 7 months ago