3 results found Sort:

Implementation of Block Recurrent Transformer - Pytorch
Created 2023-02-07
65 commits to main branch, last one 6 months ago
Implementation of Flash Attention in Jax
Created 2022-07-12
54 commits to main branch, last one about a year ago
1
29
apache-2.0
2
[ICLR 2025] TidalDecode: A Fast and Accurate LLM Decoding with Position Persistent Sparse Attention
Created 2024-07-16
56 commits to main branch, last one 5 months ago