53 results found Sort:
- Filter by Primary Language:
- Python (35)
- Jupyter Notebook (9)
- JavaScript (2)
- TypeScript (1)
- +
pix2tex: Using a ViT to convert images of equations into LaTeX code.
Created
2020-12-11
323 commits to main branch, last one 24 days ago
An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites
Created
2021-09-15
1,600 commits to main branch, last one 5 months ago
Towhee is a framework that is dedicated to making neural data processing pipelines simple and fast.
Created
2021-07-13
1,586 commits to main branch, last one 2 months ago
[CVPR 2021] Official PyTorch implementation for Transformer Interpretability Beyond Attention Visualization, a novel method to visualize classifications by Transformer based networks.
Created
2020-11-23
86 commits to main branch, last one about a year ago
Open-source evaluation toolkit of large vision-language models (LVLMs), support 160+ VLMs, 50+ benchmarks
Created
2023-12-01
1,122 commits to main branch, last one 9 hours ago
A fast, easy-to-use, production-ready inference server for computer vision supporting deployment of many popular model architectures and fine-tuned models.
Created
2023-07-31
5,134 commits to main branch, last one 10 days ago
:robot: PaddleViT: State-of-the-art Visual Transformer and MLP Models for PaddlePaddle 2.0+
Created
2021-08-30
801 commits to develop branch, last one 2 years ago
ICCV2021, Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet
Created
2021-01-23
63 commits to main branch, last one 2 years ago
A paper list of some recent Transformer-based CV works.
Created
2021-04-14
2,436 commits to main branch, last one 8 hours ago
Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Models
Created
2022-09-01
29 commits to main branch, last one 6 months ago
Extract video features from raw videos using multiple GPUs. We support RAFT flow frames as well as S3D, I3D, R(2+1)D, VGGish, CLIP, and TIMM models.
Created
2020-06-01
98 commits to master branch, last one 2 months ago
A PyTorch implementation of "MobileViT: Light-weight, General-purpose, and Mobile-friendly Vision Transformer"
Created
2021-10-07
4 commits to master branch, last one 2 years ago
SimpleAICV:pytorch training and testing examples.
Created
2020-05-31
90 commits to master branch, last one 2 days ago
FFCS course registration made hassle free for VITians. Search courses and visualize the timetable on the go!
Created
2016-03-04
744 commits to master branch, last one about a month ago
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
Created
2020-10-03
38 commits to main branch, last one 3 years ago
PASSL包含 SimCLR,MoCo v1/v2,BYOL,CLIP,PixPro,simsiam, SwAV, BEiT,MAE 等图像自监督算法以及 Vision Transformer,DEiT,Swin Transformer,CvT,T2T-ViT,MLP-Mixer,XCiT,ConvNeXt,PVTv2 等基础视觉算法
Created
2021-01-28
127 commits to main branch, last one about a year ago
i. A practical application of Transformer (ViT) on 2-D physiological signal (EEG) classification tasks. Also could be tried with EMG, EOG, ECG, etc. ii. Including the attention of spatial dimension (c...
Created
2021-05-27
30 commits to main branch, last one about a year ago
Official Code of Paper "Reversible Column Networks" "RevColv2"
Created
2022-12-22
17 commits to main branch, last one about a year ago
My implementation of "Patch n’ Pack: NaViT, a Vision Transformer for any Aspect Ratio and Resolution"
Created
2023-09-28
18 commits to main branch, last one 11 months ago
MoH: Multi-Head Attention as Mixture-of-Head Attention
Created
2024-10-08
19 commits to main branch, last one 2 months ago
HugsVision is a easy to use huggingface wrapper for state-of-the-art computer vision
Created
2021-08-12
75 commits to main branch, last one about a year ago
[MedIA Journal] An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites
Created
2022-03-13
86 commits to main branch, last one about a year ago
A ViT based transformer applied on multi-channel time-series EEG data for motor imagery classification
Created
2022-02-19
9 commits to main branch, last one about a month ago
Open source implementation of "Vision Transformers Need Registers"
Created
2023-10-04
21 commits to main branch, last one 10 months ago
reproduction of semantic segmentation using masked autoencoder (mae)
Created
2022-02-03
3 commits to main branch, last one 2 years ago
Mimix: A Text Generation Tool and Pretrained Chinese Models
vit
clip
gpt-2
seq2seq
chinese-nlp
generative-qa
summarization
tag-generation
chinese-chatbot
text-similarity
essay-generation
novel-generation
poetry-generation
pretrained-models
comment-generation
question-generation
spelling-correction
product-review-generation
chinese-english-translator
product-description-generation
Created
2021-08-13
238 commits to main branch, last one 2 months ago
Paddle Large Scale Classification Tools,supports ArcFace, CosFace, PartialFC, Data Parallel + Model Parallel. Model includes ResNet, ViT, Swin, DeiT, CaiT, FaceViT, MoCo, MAE, ConvMAE, CAE.
Created
2019-12-13
172 commits to master branch, last one about a year ago
Official implementation for paper "LightViT: Towards Light-Weight Convolution-Free Vision Transformers"
Created
2022-07-05
10 commits to main branch, last one 2 years ago
📖A curated list of Awesome Diffusion Inference Papers with codes, such as Sampling, Caching, Multi-GPUs, etc. 🎉🎉
Created
2024-01-14
58 commits to main branch, last one 11 hours ago
Simple and easy to understand PyTorch implementation of Vision Transformer (ViT) from scratch, with detailed steps. Tested on common datasets like MNIST, CIFAR10, and more.
Created
2021-10-14
170 commits to main branch, last one 7 days ago