4 results found Sort:
A collection of memory efficient attention operators implemented in the Triton language.
Created
2023-10-07
47 commits to main branch, last one 9 months ago
Triton implementation of FlashAttention2 that adds Custom Masks.
Created
2024-07-20
18 commits to main branch, last one 7 months ago
Triton implement of bi-directional (non-causal) linear attention
Created
2024-11-20
32 commits to main branch, last one about a month ago
VIT inference in triton because, why not?
Created
2024-04-15
34 commits to main branch, last one 9 months ago