4 results found Sort:

Implementation of 💍 Ring Attention, from Liu et al. at Berkeley AI, in Pytorch
Created 2024-02-14
196 commits to main branch, last one 2 months ago
Implementation of the conditionally routed attention in the CoLT5 architecture, in Pytorch
Created 2023-03-20
103 commits to main branch, last one 14 days ago
Efficient Infinite Context Transformers with Infini-attention Pytorch Implementation + QwenMoE Implementation + Training Script + 1M context keypass retrieval
Created 2024-04-13
32 commits to main branch, last one about a month ago
Unofficial PyTorch implementation of the paper "cosFormer: Rethinking Softmax In Attention".
Created 2021-10-20
20 commits to main branch, last one 2 years ago