2 results found Sort:
Implementation of Mega, the Single-head Attention with Multi-headed EMA architecture that currently holds SOTA on Long Range Arena
Created
2022-09-23
26 commits to main branch, last one about a year ago
Hrrformer: A Neuro-symbolic Self-attention Model (ICML23)
Created
2023-05-09
10 commits to main branch, last one about a year ago