54 results found Sort:

pix2tex: Using a ViT to convert images of equations into LaTeX code.
Created 2020-12-11
324 commits to main branch, last one 2 months ago
An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites
Created 2021-09-15
1,600 commits to main branch, last one 7 months ago
258
3.3k
apache-2.0
29
Towhee is a framework that is dedicated to making neural data processing pipelines simple and fast.
Created 2021-07-13
1,586 commits to main branch, last one 5 months ago
298
2.1k
apache-2.0
12
Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks
Created 2023-12-01
1,246 commits to main branch, last one 10 hours ago
[CVPR 2021] Official PyTorch implementation for Transformer Interpretability Beyond Attention Visualization, a novel method to visualize classifications by Transformer based networks.
Created 2020-11-23
86 commits to main branch, last one 2 years ago
158
1.6k
other
23
Turn any computer or edge device into a command center for your computer vision projects.
Created 2023-07-31
6,371 commits to main branch, last one a day ago
322
1.2k
apache-2.0
11
:robot: PaddleViT: State-of-the-art Visual Transformer and MLP Models for PaddlePaddle 2.0+
Created 2021-08-30
801 commits to develop branch, last one 2 years ago
A paper list of some recent Transformer-based CV works.
Created 2021-04-14
2,731 commits to main branch, last one 21 hours ago
ICCV2021, Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet
Created 2021-01-23
63 commits to main branch, last one 2 years ago
65
780
apache-2.0
7
Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Models
Created 2022-09-01
29 commits to main branch, last one 8 months ago
Extract video features from raw videos using multiple GPUs. We support RAFT flow frames as well as S3D, I3D, R(2+1)D, VGGish, CLIP, and TIMM models.
Created 2020-06-01
100 commits to master branch, last one about a month ago
A PyTorch implementation of "MobileViT: Light-weight, General-purpose, and Mobile-friendly Vision Transformer"
Created 2021-10-07
4 commits to master branch, last one 3 years ago
SimpleAICV:pytorch training and testing examples.
Created 2020-05-31
95 commits to master branch, last one 9 days ago
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
Created 2020-10-03
38 commits to main branch, last one 3 years ago
84
291
gpl-3.0
6
FFCS course registration made hassle free for VITians. Search courses and visualize the timetable on the go!
Created 2016-03-04
744 commits to master branch, last one 4 months ago
i. A practical application of Transformer (ViT) on 2-D physiological signal (EEG) classification tasks. Also could be tried with EMG, EOG, ECG, etc. ii. Including the attention of spatial dimension (c...
Created 2021-05-27
30 commits to main branch, last one 2 years ago
65
281
apache-2.0
9
PASSL包含 SimCLR,MoCo v1/v2,BYOL,CLIP,PixPro,simsiam, SwAV, BEiT,MAE 等图像自监督算法以及 Vision Transformer,DEiT,Swin Transformer,CvT,T2T-ViT,MLP-Mixer,XCiT,ConvNeXt,PVTv2 等基础视觉算法
Created 2021-01-28
127 commits to main branch, last one about a year ago
10
259
apache-2.0
12
Official Code of Paper "Reversible Column Networks" "RevColv2"
Created 2022-12-22
17 commits to main branch, last one about a year ago
9
231
apache-2.0
3
MoH: Multi-Head Attention as Mixture-of-Head Attention
Created 2024-10-08
19 commits to main branch, last one 4 months ago
10
224
mit
6
My implementation of "Patch n’ Pack: NaViT, a Vision Transformer for any Aspect Ratio and Resolution"
Created 2023-09-28
18 commits to main branch, last one about a year ago
📖A curated list of Awesome Diffusion Inference Papers with codes: Sampling, Caching, Multi-GPUs, etc. 🎉🎉
Created 2024-01-14
66 commits to main branch, last one 2 months ago
HugsVision is a easy to use huggingface wrapper for state-of-the-art computer vision
Created 2021-08-12
75 commits to main branch, last one 2 years ago
[MedIA Journal] An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites
Created 2022-03-13
86 commits to main branch, last one about a year ago
A ViT based transformer applied on multi-channel time-series EEG data for motor imagery classification
Created 2022-02-19
9 commits to main branch, last one 3 months ago
Open source implementation of "Vision Transformers Need Registers"
Created 2023-10-04
21 commits to main branch, last one about a year ago
reproduction of semantic segmentation using masked autoencoder (mae)
Created 2022-02-03
3 commits to main branch, last one 3 years ago
34
151
apache-2.0
19
Paddle Large Scale Classification Tools,supports ArcFace, CosFace, PartialFC, Data Parallel + Model Parallel. Model includes ResNet, ViT, Swin, DeiT, CaiT, FaceViT, MoCo, MAE, ConvMAE, CAE.
Created 2019-12-13
172 commits to master branch, last one about a year ago
10
139
apache-2.0
2
Official implementation for paper "LightViT: Towards Light-Weight Convolution-Free Vision Transformers"
Created 2022-07-05
10 commits to main branch, last one 2 years ago
Simple and easy to understand PyTorch implementation of Vision Transformer (ViT) from scratch, with detailed steps. Tested on common datasets like MNIST, CIFAR10, and more.
Created 2021-10-14
170 commits to main branch, last one 2 months ago