1 result found Sort:

My Implementation of Q-Sparse: All Large Language Models can be Fully Sparsely-Activated
Created 2024-07-21
15 commits to main branch, last one about a month ago