1 result found Sort:

10
89
mit
25
[ICLR 2023] "Learning to Grow Pretrained Models for Efficient Transformer Training" by Peihao Wang, Rameswar Panda, Lucas Torroba Hennigen, Philip Greengard, Leonid Karlinsky, Rogerio Feris, David Cox...
Created 2023-02-21
18 commits to main branch, last one about a year ago