54 results found Sort:

pix2tex: Using a ViT to convert images of equations into LaTeX code.
Created 2020-12-11
324 commits to main branch, last one about a month ago
An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites
Created 2021-09-15
1,600 commits to main branch, last one 6 months ago
255
3.3k
apache-2.0
30
Towhee is a framework that is dedicated to making neural data processing pipelines simple and fast.
Created 2021-07-13
1,586 commits to main branch, last one 4 months ago
[CVPR 2021] Official PyTorch implementation for Transformer Interpretability Beyond Attention Visualization, a novel method to visualize classifications by Transformer based networks.
Created 2020-11-23
86 commits to main branch, last one 2 years ago
266
1.8k
apache-2.0
11
Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks
Created 2023-12-01
1,188 commits to main branch, last one a day ago
144
1.5k
other
23
Turn any computer or edge device into a command center for your computer vision projects.
Created 2023-07-31
5,944 commits to main branch, last one 12 hours ago
323
1.2k
apache-2.0
11
:robot: PaddleViT: State-of-the-art Visual Transformer and MLP Models for PaddlePaddle 2.0+
Created 2021-08-30
801 commits to develop branch, last one 2 years ago
A paper list of some recent Transformer-based CV works.
Created 2021-04-14
2,585 commits to main branch, last one 3 days ago
ICCV2021, Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet
Created 2021-01-23
63 commits to main branch, last one 2 years ago
63
774
apache-2.0
7
Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Models
Created 2022-09-01
29 commits to main branch, last one 7 months ago
Extract video features from raw videos using multiple GPUs. We support RAFT flow frames as well as S3D, I3D, R(2+1)D, VGGish, CLIP, and TIMM models.
Created 2020-06-01
100 commits to master branch, last one 17 days ago
A PyTorch implementation of "MobileViT: Light-weight, General-purpose, and Mobile-friendly Vision Transformer"
Created 2021-10-07
4 commits to master branch, last one 3 years ago
SimpleAICV:pytorch training and testing examples.
Created 2020-05-31
93 commits to master branch, last one 2 days ago
84
291
gpl-3.0
6
FFCS course registration made hassle free for VITians. Search courses and visualize the timetable on the go!
Created 2016-03-04
744 commits to master branch, last one 3 months ago
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
Created 2020-10-03
38 commits to main branch, last one 3 years ago
65
279
apache-2.0
10
PASSL包含 SimCLR,MoCo v1/v2,BYOL,CLIP,PixPro,simsiam, SwAV, BEiT,MAE 等图像自监督算法以及 Vision Transformer,DEiT,Swin Transformer,CvT,T2T-ViT,MLP-Mixer,XCiT,ConvNeXt,PVTv2 等基础视觉算法
Created 2021-01-28
127 commits to main branch, last one about a year ago
i. A practical application of Transformer (ViT) on 2-D physiological signal (EEG) classification tasks. Also could be tried with EMG, EOG, ECG, etc. ii. Including the attention of spatial dimension (c...
Created 2021-05-27
30 commits to main branch, last one about a year ago
10
257
apache-2.0
12
Official Code of Paper "Reversible Column Networks" "RevColv2"
Created 2022-12-22
17 commits to main branch, last one about a year ago
10
212
mit
6
My implementation of "Patch n’ Pack: NaViT, a Vision Transformer for any Aspect Ratio and Resolution"
Created 2023-09-28
18 commits to main branch, last one about a year ago
7
201
apache-2.0
3
MoH: Multi-Head Attention as Mixture-of-Head Attention
Created 2024-10-08
19 commits to main branch, last one 3 months ago
HugsVision is a easy to use huggingface wrapper for state-of-the-art computer vision
Created 2021-08-12
75 commits to main branch, last one 2 years ago
[MedIA Journal] An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites
Created 2022-03-13
86 commits to main branch, last one about a year ago
📖A curated list of Awesome Diffusion Inference Papers with codes: Sampling, Caching, Multi-GPUs, etc. 🎉🎉
Created 2024-01-14
66 commits to main branch, last one about a month ago
A ViT based transformer applied on multi-channel time-series EEG data for motor imagery classification
Created 2022-02-19
9 commits to main branch, last one 2 months ago
Open source implementation of "Vision Transformers Need Registers"
Created 2023-10-04
21 commits to main branch, last one about a year ago
reproduction of semantic segmentation using masked autoencoder (mae)
Created 2022-02-03
3 commits to main branch, last one 3 years ago
35
151
apache-2.0
20
Paddle Large Scale Classification Tools,supports ArcFace, CosFace, PartialFC, Data Parallel + Model Parallel. Model includes ResNet, ViT, Swin, DeiT, CaiT, FaceViT, MoCo, MAE, ConvMAE, CAE.
Created 2019-12-13
172 commits to master branch, last one about a year ago
10
137
apache-2.0
2
Official implementation for paper "LightViT: Towards Light-Weight Convolution-Free Vision Transformers"
Created 2022-07-05
10 commits to main branch, last one 2 years ago
Simple and easy to understand PyTorch implementation of Vision Transformer (ViT) from scratch, with detailed steps. Tested on common datasets like MNIST, CIFAR10, and more.
Created 2021-10-14
170 commits to main branch, last one about a month ago