46 results found Sort:
- Filter by Primary Language:
- Python (38)
- Jupyter Notebook (6)
- +
A PyTorch implementation of the Transformer model in "Attention is All You Need".
Created
2017-06-14
196 commits to master branch, last one 3 years ago
A TensorFlow Implementation of the Transformer: Attention Is All You Need
Created
2017-06-17
37 commits to master branch, last one 5 years ago
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
Created
2023-01-13
248 commits to master branch, last one 2 days ago
Sequence-to-sequence framework with a focus on Neural Machine Translation based on PyTorch
pytorch
seq2seq
sockeye
transformer
translation
deep-learning
attention-model
encoder-decoder
machine-learning
attention-mechanism
machine-translation
transformer-network
deep-neural-networks
sequence-to-sequence
transformer-architecture
attention-is-all-you-need
neural-machine-translation
sequence-to-sequence-models
Created
2017-06-08
836 commits to main branch, last one 13 days ago
list of efficient attention modules
This repository has been archived
(exclude archived)
Created
2020-07-31
46 commits to master branch, last one 3 years ago
Pre-training of Deep Bidirectional Transformers for Language Understanding: pre-train TextCNN
Created
2018-10-23
101 commits to master branch, last one 5 years ago
A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.
Created
2018-11-22
23 commits to master branch, last one about a year ago
A Keras+TensorFlow Implementation of the Transformer: Attention Is All You Need
Created
2018-03-16
29 commits to master branch, last one 3 years ago
Implementation of plug in and play Attention from "LongNet: Scaling Transformers to 1,000,000,000 Tokens"
Created
2023-07-06
181 commits to master branch, last one 10 months ago
Attention is all you need implementation
Created
2023-05-18
53 commits to main branch, last one 11 months ago
Open-Source Toolkit for End-to-End Korean Automatic Speech Recognition leveraging PyTorch and Hydra.
Created
2019-10-30
2,212 commits to latest branch, last one about a year ago
A Benchmark of Text Classification in PyTorch
Created
2017-12-13
205 commits to master branch, last one 6 months ago
Neural Machine Translation with Keras
Created
2016-11-22
681 commits to master branch, last one 3 years ago
An open source implementation of "Scaling Autoregressive Multi-Modal Models: Pretraining and Instruction Tuning", an all-new multi modal AI that uses just a decoder to generate both text and images
Created
2023-07-14
73 commits to main branch, last one 10 months ago
USP: Unified (a.k.a. Hybrid, 2D) Sequence Parallel Attention for Long Context Transformers Model Training and Inference
Created
2024-03-27
198 commits to main branch, last one 21 days ago
Implementation of the ScreenAI model from the paper: "A Vision-Language Model for UI and Infographics Understanding"
Created
2024-02-08
20 commits to main branch, last one 8 months ago
[CVPR 2024] Official implementation of CVPR 2024 paper: "Inversion-Free Image Editing with Natural Language"
Created
2023-12-04
27 commits to main branch, last one 5 months ago
Implementation of "PaLM-E: An Embodied Multimodal Language Model"
Created
2023-06-09
190 commits to main branch, last one 9 months ago
Attention Is All You Need | a PyTorch Tutorial to Transformers
Created
2020-01-30
19 commits to master branch, last one 11 months ago
pytorch implementation of Attention is all you need
Created
2018-01-05
8 commits to master branch, last one 6 years ago
Original transformer paper: Implementation of Vaswani, Ashish, et al. "Attention is all you need." Advances in neural information processing systems. 2017.
Created
2021-12-02
15 commits to master branch, last one about a year ago
Notes about "Attention is all you need" video (https://www.youtube.com/watch?v=bCz4OMemCcA)
Created
2023-05-25
6 commits to main branch, last one about a year ago
Pytorch implementation of the models RT-1-X and RT-2-X from the paper: "Open X-Embodiment: Robotic Learning Datasets and RT-X Models"
Created
2023-10-04
119 commits to main branch, last one 7 months ago
Integrating Mamba/SSMs with Transformer for Enhanced Long Context and High-Quality Sequence Modeling
Created
2024-01-13
21 commits to main branch, last one 7 months ago
[ICRA 2023] Intention Aware Robot Crowd Navigation with Attention-Based Interaction Graph
Created
2023-01-03
17 commits to main branch, last one 7 months ago
Build your own Face App with Stable Diffusion 2.1
Created
2024-06-29
54 commits to main branch, last one about a month ago
PyTorch Implementation of Jamba: "Jamba: A Hybrid Transformer-Mamba Language Model"
Created
2024-04-01
10 commits to main branch, last one 7 months ago
Transformers without Tears: Improving the Normalization of Self-Attention
This repository has been archived
(exclude archived)
Created
2019-10-15
17 commits to master branch, last one 5 months ago
Implementation of the LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens Paper
ai
gpt
llm
nlp
natural
tokenizers
tokenization
transformers
deep-learning
language-model
machine-learning
attention-mechanisms
artificial-intelligence
transformer-architecture
attention-is-all-you-need
natural-language-inference
natural-language-processing
natural-language-procressing
natural-language-understanding
Created
2024-03-06
266 commits to master branch, last one 3 months ago
This repository has no description...
Created
2018-03-31
167 commits to master branch, last one 5 years ago