6 results found Sort:

812
11.8k
apache-2.0
136
RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, sa...
Created 2021-08-08
614 commits to main branch, last one 20 hours ago
Explorations into the recently proposed Taylor Series Linear Attention
Created 2023-12-23
33 commits to main branch, last one 4 months ago
Implementation of Agent Attention in Pytorch
Created 2023-12-18
19 commits to main branch, last one 5 months ago
The semantic segmentation of remote sensing images
Created 2020-08-25
36 commits to master branch, last one about a year ago
CUDA implementation of autoregressive linear attention, with all the latest research findings
Created 2023-02-07
4 commits to main branch, last one about a year ago
2
41
agpl-3.0
3
The semantic segmentation of remote sensing images
Created 2020-11-28
18 commits to main branch, last one about a year ago