4 results found Sort:

A collection of memory efficient attention operators implemented in the Triton language.
Created 2023-10-07
47 commits to main branch, last one 9 months ago
Triton implementation of FlashAttention2 that adds Custom Masks.
Created 2024-07-20
18 commits to main branch, last one 7 months ago
Triton implement of bi-directional (non-causal) linear attention
Created 2024-11-20
32 commits to main branch, last one about a month ago
VIT inference in triton because, why not?
Created 2024-04-15
34 commits to main branch, last one 9 months ago