1 result found Sort:
Implementation of Griffin from the paper: "Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models"
Created
2024-03-04
15 commits to main branch, last one 9 months ago