5 results found Sort:

Implementation of 💍 Ring Attention, from Liu et al. at Berkeley AI, in Pytorch
Created 2024-02-14
221 commits to main branch, last one about a month ago
Implementation of the conditionally routed attention in the CoLT5 architecture, in Pytorch
Created 2023-03-20
104 commits to main branch, last one 3 months ago
Efficient Infinite Context Transformers with Infini-attention Pytorch Implementation + QwenMoE Implementation + Training Script + 1M context keypass retrieval
Created 2024-04-13
32 commits to main branch, last one 7 months ago
Unofficial PyTorch implementation of the paper "cosFormer: Rethinking Softmax In Attention".
Created 2021-10-20
20 commits to main branch, last one 3 years ago
The official PyTorch implementation for CascadedGaze: Efficiency in Global Context Extraction for Image Restoration, TMLR'24.
Created 2024-05-07
8 commits to main branch, last one 2 months ago