46 results found Sort:

pix2tex: Using a ViT to convert images of equations into LaTeX code.
Created 2020-12-11
306 commits to main branch, last one 8 months ago
An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites
Created 2021-09-15
1,594 commits to main branch, last one 10 days ago
239
3.0k
apache-2.0
29
Towhee is a framework that is dedicated to making neural data processing pipelines simple and fast.
Created 2021-07-13
1,576 commits to main branch, last one 4 months ago
[CVPR 2021] Official PyTorch implementation for Transformer Interpretability Beyond Attention Visualization, a novel method to visualize classifications by Transformer based networks.
Created 2020-11-23
86 commits to main branch, last one about a year ago
315
1.2k
apache-2.0
10
:robot: PaddleViT: State-of-the-art Visual Transformer and MLP Models for PaddlePaddle 2.0+
Created 2021-08-30
801 commits to develop branch, last one about a year ago
ICCV2021, Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet
Created 2021-01-23
63 commits to main branch, last one about a year ago
78
1.1k
other
19
A fast, easy-to-use, production-ready inference server for computer vision supporting deployment of many popular model architectures and fine-tuned models.
Created 2023-07-31
2,373 commits to main branch, last one 4 days ago
A paper list of some recent Transformer-based CV works.
Created 2021-04-14
1,583 commits to main branch, last one a day ago
63
728
apache-2.0
7
Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Models
Created 2022-09-01
28 commits to main branch, last one about a month ago
59
504
apache-2.0
8
Open-source evaluation toolkit of large vision-language models (LVLMs), support GPT-4v, Gemini, QwenVLPlus, 50+ HF models, 20+ benchmarks
Created 2023-12-01
488 commits to main branch, last one a day ago
A PyTorch implementation of "MobileViT: Light-weight, General-purpose, and Mobile-friendly Vision Transformer"
Created 2021-10-07
4 commits to master branch, last one 2 years ago
Extract video features from raw videos using multiple GPUs. We support RAFT flow frames as well as S3D, I3D, R(2+1)D, VGGish, CLIP, and TIMM models.
Created 2020-06-01
96 commits to master branch, last one 2 months ago
SimpleAICV:pytorch training and testing examples.
Created 2020-05-31
75 commits to master branch, last one 14 days ago
71
285
gpl-3.0
6
FFCS course registration made hassle free for VITians. Search courses and visualize the timetable on the go!
Created 2016-03-04
700 commits to master branch, last one 29 days ago
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
Created 2020-10-03
38 commits to main branch, last one 2 years ago
63
263
apache-2.0
12
PASSL包含 SimCLR,MoCo v1/v2,BYOL,CLIP,PixPro,simsiam, SwAV, BEiT,MAE 等图像自监督算法以及 Vision Transformer,DEiT,Swin Transformer,CvT,T2T-ViT,MLP-Mixer,XCiT,ConvNeXt,PVTv2 等基础视觉算法
Created 2021-01-28
127 commits to main branch, last one 10 months ago
10
245
apache-2.0
12
Official Code of Paper "Reversible Column Networks" "RevColv2"
Created 2022-12-22
17 commits to main branch, last one 8 months ago
i. A practical application of Transformer (ViT) on 2-D physiological signal (EEG) classification tasks. Also could be tried with EMG, EOG, ECG, etc. ii. Including the attention of spatial dimension (c...
Created 2021-05-27
30 commits to main branch, last one about a year ago
HugsVision is a easy to use huggingface wrapper for state-of-the-art computer vision
Created 2021-08-12
75 commits to main branch, last one about a year ago
reproduction of semantic segmentation using masked autoencoder (mae)
Created 2022-02-03
3 commits to main branch, last one 2 years ago
33
144
apache-2.0
21
Paddle Large Scale Classification Tools,supports ArcFace, CosFace, PartialFC, Data Parallel + Model Parallel. Model includes ResNet, ViT, Swin, DeiT, CaiT, FaceViT, MoCo, MAE, ConvMAE, CAE.
Created 2019-12-13
172 commits to master branch, last one 12 months ago
[MedIA Journal] An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites
Created 2022-03-13
86 commits to main branch, last one 6 months ago
A ViT based transformer applied on multi-channel time-series EEG data for motor imagery classification
Created 2022-02-19
8 commits to main branch, last one about a year ago
10
134
apache-2.0
2
Official implementation for paper "LightViT: Towards Light-Weight Convolution-Free Vision Transformers"
Created 2022-07-05
10 commits to main branch, last one about a year ago
My implementation of "Patch n’ Pack: NaViT, a Vision Transformer for any Aspect Ratio and Resolution"
Created 2023-09-28
18 commits to main branch, last one 3 months ago
7
100
mit
3
🚀 React application framework inspired by UmiJS / 类 UmiJS 的 React 应用框架
Created 2021-03-24
139 commits to master branch, last one about a year ago
Open source implementation of "Vision Transformers Need Registers"
Created 2023-10-04
21 commits to main branch, last one 3 months ago
An unofficial implementation of ViTPose [Y. Xu et al., 2022]
Created 2022-12-18
23 commits to main branch, last one about a year ago
This code repository contains the code used for my "Optimizing Memory Usage for Training LLMs and Vision Transformers in PyTorch" blog post.
Created 2023-06-27
11 commits to main branch, last one 10 months ago