4 results found Sort:

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) ...
Created 2017-08-22
274 commits to master branch, last one 2 years ago
{KFAC,EKFAC,Diagonal,Implicit} Fisher Matrices and finite width NTKs in PyTorch
Created 2019-09-12
451 commits to master branch, last one about a month ago
Distributed K-FAC Preconditioner for PyTorch
Created 2020-02-16
614 commits to main branch, last one 4 days ago
Bayesian Low-Rank Adaptation for Large Language Models
Created 2024-01-19
41 commits to master branch, last one 3 months ago