55 results found Sort:

Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM
Created 2022-12-09
128 commits to main branch, last one 10 months ago
An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites
Created 2021-09-15
1,600 commits to main branch, last one 3 months ago
Implementation of MusicLM, Google's new SOTA model for music generation using attention networks, in Pytorch
Created 2023-01-27
72 commits to main branch, last one about a year ago
Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch
Created 2022-09-09
554 commits to main branch, last one 12 days ago
Implementation of Toolformer, Language Models That Can Use Tools, by MetaAI
Created 2023-02-10
63 commits to main branch, last one 11 months ago
Implementation of Make-A-Video, new SOTA text to video generator from Meta AI, in Pytorch
Created 2022-09-29
42 commits to main branch, last one 6 months ago
Implementation of Alphafold 3 in Pytorch
Created 2024-05-08
990 commits to main branch, last one 3 days ago
Awesome List of Attention Modules and Plug&Play Modules in Computer Vision
Created 2021-01-10
110 commits to main branch, last one about a year ago
Implementation of Muse: Text-to-Image Generation via Masked Generative Transformers, in Pytorch
Created 2023-01-03
84 commits to main branch, last one 8 months ago
Implementation of Phenaki Video, which uses Mask GIT to produce text guided videos of up to 2 minutes in length, in Pytorch
Created 2022-09-29
147 commits to main branch, last one 3 months ago
Implementation of MeshGPT, SOTA Mesh generation using Attention, in Pytorch
Created 2023-11-29
295 commits to main branch, last one about a month ago
64
686
apache-2.0
18
Implementation of plug in and play Attention from "LongNet: Scaling Transformers to 1,000,000,000 Tokens"
Created 2023-07-06
181 commits to master branch, last one 10 months ago
PyTorch Dual-Attention LSTM-Autoencoder For Multivariate Time Series
Created 2020-06-05
46 commits to master branch, last one 25 days ago
Implementation of MEGABYTE, Predicting Million-byte Sequences with Multiscale Transformers, in Pytorch
Created 2023-05-15
35 commits to main branch, last one 2 months ago
Implementation of MagViT2 Tokenizer in Pytorch
Created 2023-10-10
186 commits to main branch, last one about a month ago
Unofficial implementation of iTransformer - SOTA Time Series Forecasting using Attention networks, out of Tsinghua / Ant group
Created 2023-10-11
47 commits to main branch, last one 6 months ago
Implementation of Band Split Roformer, SOTA Attention network for music source separation out of ByteDance AI Labs
Created 2023-09-09
62 commits to main branch, last one 3 months ago
Official PyTorch Implementation for "Rotate to Attend: Convolutional Triplet Attention Module." [WACV 2021]
Created 2020-04-14
98 commits to master branch, last one 3 years ago
🦖Pytorch implementation of popular Attention Mechanisms, Vision Transformers, MLP-Like models and CNNs.🔥🔥🔥
Created 2023-05-26
218 commits to master branch, last one about a year ago
Implementation of Recurrent Memory Transformer, Neurips 2022 paper, in Pytorch
Created 2023-04-24
69 commits to main branch, last one a day ago
Implementation of RT1 (Robotic Transformer) in Pytorch
Created 2022-12-13
39 commits to main branch, last one about a month ago
An implementation of local windowed attention for language modeling
Created 2020-07-05
61 commits to master branch, last one 2 months ago
Implementation of Q-Transformer, Scalable Offline Reinforcement Learning via Autoregressive Q-Functions, out of Google Deepmind
Created 2023-09-20
113 commits to main branch, last one 5 days ago
Implementation of ChatGPT, but tailored towards primary care medicine, with the reward being able to collect patient histories in a thorough and efficient manner and come up with a reasonable differen...
Created 2022-12-10
15 commits to main branch, last one about a year ago
Implementation of a single layer of the MMDiT, proposed in Stable Diffusion 3, in Pytorch
Created 2024-05-04
22 commits to main branch, last one 2 months ago
Implementation of the Equiformer, SE3/E3 equivariant attention network that reaches new SOTA, and adopted for use by EquiFold for protein folding
Created 2022-10-29
171 commits to main branch, last one 3 days ago
Implementation of the conditionally routed attention in the CoLT5 architecture, in Pytorch
Created 2023-03-20
104 commits to main branch, last one 2 months ago
Implementation of Block Recurrent Transformer - Pytorch
Created 2023-02-07
65 commits to main branch, last one 3 months ago
Learning YOLOv3 from scratch 从零开始学习YOLOv3代码
Created 2020-01-15
2,087 commits to master branch, last one 2 years ago
Implementation of fused cosine similarity attention in the same style as Flash Attention
Created 2022-08-04
297 commits to main branch, last one about a year ago