52 results found Sort:

pix2tex: Using a ViT to convert images of equations into LaTeX code.
Created 2020-12-11
306 commits to main branch, last one about a year ago
An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites
Created 2021-09-15
1,600 commits to main branch, last one 3 months ago
253
3.2k
apache-2.0
29
Towhee is a framework that is dedicated to making neural data processing pipelines simple and fast.
Created 2021-07-13
1,586 commits to main branch, last one about a month ago
[CVPR 2021] Official PyTorch implementation for Transformer Interpretability Beyond Attention Visualization, a novel method to visualize classifications by Transformer based networks.
Created 2020-11-23
86 commits to main branch, last one about a year ago
193
1.4k
apache-2.0
11
Open-source evaluation toolkit of large vision-language models (LVLMs), support 160+ VLMs, 50+ benchmarks
Created 2023-12-01
1,017 commits to main branch, last one a day ago
130
1.4k
other
23
A fast, easy-to-use, production-ready inference server for computer vision supporting deployment of many popular model architectures and fine-tuned models.
Created 2023-07-31
4,691 commits to main branch, last one 5 days ago
319
1.2k
apache-2.0
10
:robot: PaddleViT: State-of-the-art Visual Transformer and MLP Models for PaddlePaddle 2.0+
Created 2021-08-30
801 commits to develop branch, last one 2 years ago
ICCV2021, Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet
Created 2021-01-23
63 commits to main branch, last one 2 years ago
A paper list of some recent Transformer-based CV works.
Created 2021-04-14
2,292 commits to main branch, last one 21 hours ago
64
761
apache-2.0
7
Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Models
Created 2022-09-01
29 commits to main branch, last one 4 months ago
Extract video features from raw videos using multiple GPUs. We support RAFT flow frames as well as S3D, I3D, R(2+1)D, VGGish, CLIP, and TIMM models.
Created 2020-06-01
98 commits to master branch, last one 26 days ago
A PyTorch implementation of "MobileViT: Light-weight, General-purpose, and Mobile-friendly Vision Transformer"
Created 2021-10-07
4 commits to master branch, last one 2 years ago
SimpleAICV:pytorch training and testing examples.
Created 2020-05-31
85 commits to master branch, last one 5 days ago
83
291
gpl-3.0
6
FFCS course registration made hassle free for VITians. Search courses and visualize the timetable on the go!
Created 2016-03-04
744 commits to master branch, last one 9 days ago
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
Created 2020-10-03
38 commits to main branch, last one 3 years ago
65
276
apache-2.0
11
PASSL包含 SimCLR,MoCo v1/v2,BYOL,CLIP,PixPro,simsiam, SwAV, BEiT,MAE 等图像自监督算法以及 Vision Transformer,DEiT,Swin Transformer,CvT,T2T-ViT,MLP-Mixer,XCiT,ConvNeXt,PVTv2 等基础视觉算法
Created 2021-01-28
127 commits to main branch, last one about a year ago
i. A practical application of Transformer (ViT) on 2-D physiological signal (EEG) classification tasks. Also could be tried with EMG, EOG, ECG, etc. ii. Including the attention of spatial dimension (c...
Created 2021-05-27
30 commits to main branch, last one about a year ago
10
250
apache-2.0
12
Official Code of Paper "Reversible Column Networks" "RevColv2"
Created 2022-12-22
17 commits to main branch, last one about a year ago
HugsVision is a easy to use huggingface wrapper for state-of-the-art computer vision
Created 2021-08-12
75 commits to main branch, last one about a year ago
10
185
mit
7
My implementation of "Patch n’ Pack: NaViT, a Vision Transformer for any Aspect Ratio and Resolution"
Created 2023-09-28
18 commits to main branch, last one 9 months ago
[MedIA Journal] An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites
Created 2022-03-13
86 commits to main branch, last one about a year ago
5
157
apache-2.0
3
MoH: Multi-Head Attention as Mixture-of-Head Attention
Created 2024-10-08
19 commits to main branch, last one 22 days ago
A ViT based transformer applied on multi-channel time-series EEG data for motor imagery classification
Created 2022-02-19
8 commits to main branch, last one 2 years ago
reproduction of semantic segmentation using masked autoencoder (mae)
Created 2022-02-03
3 commits to main branch, last one 2 years ago
35
150
apache-2.0
21
Paddle Large Scale Classification Tools,supports ArcFace, CosFace, PartialFC, Data Parallel + Model Parallel. Model includes ResNet, ViT, Swin, DeiT, CaiT, FaceViT, MoCo, MAE, ConvMAE, CAE.
Created 2019-12-13
172 commits to master branch, last one about a year ago
Open source implementation of "Vision Transformers Need Registers"
Created 2023-10-04
21 commits to main branch, last one 9 months ago
10
137
apache-2.0
2
Official implementation for paper "LightViT: Towards Light-Weight Convolution-Free Vision Transformers"
Created 2022-07-05
10 commits to main branch, last one 2 years ago
Simple and easy to understand PyTorch implementation of Vision Transformer (ViT) from scratch with detailed steps. Tested on small datasets: MNIST, FashionMNIST, SVHN, CIFAR10, and CIFAR100.
Created 2021-10-14
150 commits to main branch, last one 13 days ago
21
106
apache-2.0
1
An unofficial implementation of ViTPose [Y. Xu et al., 2022]
Created 2022-12-18
23 commits to main branch, last one about a year ago