1 result found Sort:

Implementation of Griffin from the paper: "Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models"
Created 2024-03-04
15 commits to main branch, last one 9 months ago