4 results found Sort:
- Filter by Primary Language:
- Python (3)
- Jupyter Notebook (1)
- +
Implementation of 💍 Ring Attention, from Liu et al. at Berkeley AI, in Pytorch
Created
2024-02-14
196 commits to main branch, last one 2 months ago
Implementation of the conditionally routed attention in the CoLT5 architecture, in Pytorch
Created
2023-03-20
103 commits to main branch, last one 14 days ago
Efficient Infinite Context Transformers with Infini-attention Pytorch Implementation + QwenMoE Implementation + Training Script + 1M context keypass retrieval
Created
2024-04-13
32 commits to main branch, last one about a month ago
Unofficial PyTorch implementation of the paper "cosFormer: Rethinking Softmax In Attention".
Created
2021-10-20
20 commits to main branch, last one 2 years ago