41 results found Sort:

A PyTorch implementation of the Transformer model in "Attention is All You Need".
Created 2017-06-14
196 commits to master branch, last one 3 years ago
1.3k
4.2k
apache-2.0
110
A TensorFlow Implementation of the Transformer: Attention Is All You Need
Created 2017-06-17
37 commits to master branch, last one 4 years ago
list of efficient attention modules
This repository has been archived (exclude archived)
Created 2020-07-31
46 commits to master branch, last one 2 years ago
Pre-training of Deep Bidirectional Transformers for Language Understanding: pre-train TextCNN
Created 2018-10-23
101 commits to master branch, last one 5 years ago
A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.
Created 2018-11-22
23 commits to master branch, last one about a year ago
A Keras+TensorFlow Implementation of the Transformer: Attention Is All You Need
Created 2018-03-16
29 commits to master branch, last one 2 years ago
62
655
apache-2.0
18
Implementation of plug in and play Attention from "LongNet: Scaling Transformers to 1,000,000,000 Tokens"
Created 2023-07-06
181 commits to master branch, last one 4 months ago
A Benchmark of Text Classification in PyTorch
Created 2017-12-13
205 commits to master branch, last one about a month ago
193
580
apache-2.0
19
Open-Source Toolkit for End-to-End Korean Automatic Speech Recognition leveraging PyTorch and Hydra.
Created 2019-10-30
2,212 commits to latest branch, last one about a year ago
Attention is all you need implementation
Created 2023-05-18
53 commits to main branch, last one 5 months ago
17
336
mit
21
An open source implementation of "Scaling Autoregressive Multi-Modal Models: Pretraining and Instruction Tuning", an all-new multi modal AI that uses just a decoder to generate both text and images
Created 2023-07-14
73 commits to main branch, last one 5 months ago
pytorch implementation of Attention is all you need
Created 2018-01-05
8 commits to master branch, last one 6 years ago
Original transformer paper: Implementation of Vaswani, Ashish, et al. "Attention is all you need." Advances in neural information processing systems. 2017.
Created 2021-12-02
15 commits to master branch, last one about a year ago
38
219
apache-2.0
3
Implementation of "PaLM-E: An Embodied Multimodal Language Model"
Created 2023-06-09
190 commits to main branch, last one 4 months ago
Implementation of the ScreenAI model from the paper: "A Vision-Language Model for UI and Infographics Understanding"
Created 2024-02-08
20 commits to main branch, last one 3 months ago
[CVPR 2024] Official implementation of CVPR 2024 paper: "Inversion-Free Image Editing with Natural Language"
Created 2023-12-04
27 commits to main branch, last one 3 days ago
Attention Is All You Need | a PyTorch Tutorial to Transformers
Created 2020-01-30
19 commits to master branch, last one 5 months ago
Notes about "Attention is all you need" video (https://www.youtube.com/watch?v=bCz4OMemCcA)
Created 2023-05-25
6 commits to main branch, last one about a year ago
Sequence Parallel Attention for Long Context LLM Model Training and Inference
Created 2024-03-27
168 commits to main branch, last one 3 days ago
[ICRA 2023] Intention Aware Robot Crowd Navigation with Attention-Based Interaction Graph
Created 2023-01-03
17 commits to main branch, last one about a month ago
Transformers without Tears: Improving the Normalization of Self-Attention
Created 2019-10-15
17 commits to master branch, last one 2 days ago
Integrating Mamba/SSMs with Transformer for Enhanced Long Context and High-Quality Sequence Modeling
Created 2024-01-13
21 commits to main branch, last one 2 months ago
13
122
mit
8
Pytorch implementation of the models RT-1-X and RT-2-X from the paper: "Open X-Embodiment: Robotic Learning Datasets and RT-X Models"
Created 2023-10-04
119 commits to main branch, last one 2 months ago
This repository has no description...
Created 2018-03-31
167 commits to master branch, last one 5 years ago
PyTorch Implementation of Jamba: "Jamba: A Hybrid Transformer-Mamba Language Model"
Created 2024-04-01
10 commits to main branch, last one 2 months ago
Final Project for AI Wireless
Created 2020-12-04
83 commits to main branch, last one 2 years ago