2 results found Sort:

Implementation of Mega, the Single-head Attention with Multi-headed EMA architecture that currently holds SOTA on Long Range Arena
Created 2022-09-23
26 commits to main branch, last one about a year ago
Hrrformer: A Neuro-symbolic Self-attention Model (ICML23)
Created 2023-05-09
10 commits to main branch, last one about a year ago