46 results found Sort:
- Filter by Primary Language:
- Python (31)
- Jupyter Notebook (8)
- JavaScript (2)
- TypeScript (1)
- +
pix2tex: Using a ViT to convert images of equations into LaTeX code.
Created
2020-12-11
306 commits to main branch, last one 8 months ago
An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites
Created
2021-09-15
1,594 commits to main branch, last one 10 days ago
Towhee is a framework that is dedicated to making neural data processing pipelines simple and fast.
Created
2021-07-13
1,576 commits to main branch, last one 4 months ago
[CVPR 2021] Official PyTorch implementation for Transformer Interpretability Beyond Attention Visualization, a novel method to visualize classifications by Transformer based networks.
Created
2020-11-23
86 commits to main branch, last one about a year ago
:robot: PaddleViT: State-of-the-art Visual Transformer and MLP Models for PaddlePaddle 2.0+
Created
2021-08-30
801 commits to develop branch, last one about a year ago
ICCV2021, Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet
Created
2021-01-23
63 commits to main branch, last one about a year ago
A fast, easy-to-use, production-ready inference server for computer vision supporting deployment of many popular model architectures and fine-tuned models.
Created
2023-07-31
2,373 commits to main branch, last one 4 days ago
A paper list of some recent Transformer-based CV works.
Created
2021-04-14
1,583 commits to main branch, last one a day ago
Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Models
Created
2022-09-01
28 commits to main branch, last one about a month ago
Open-source evaluation toolkit of large vision-language models (LVLMs), support GPT-4v, Gemini, QwenVLPlus, 50+ HF models, 20+ benchmarks
Created
2023-12-01
488 commits to main branch, last one a day ago
A PyTorch implementation of "MobileViT: Light-weight, General-purpose, and Mobile-friendly Vision Transformer"
Created
2021-10-07
4 commits to master branch, last one 2 years ago
Extract video features from raw videos using multiple GPUs. We support RAFT flow frames as well as S3D, I3D, R(2+1)D, VGGish, CLIP, and TIMM models.
Created
2020-06-01
96 commits to master branch, last one 2 months ago
SimpleAICV:pytorch training and testing examples.
Created
2020-05-31
75 commits to master branch, last one 14 days ago
FFCS course registration made hassle free for VITians. Search courses and visualize the timetable on the go!
Created
2016-03-04
700 commits to master branch, last one 29 days ago
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
Created
2020-10-03
38 commits to main branch, last one 2 years ago
PASSL包含 SimCLR,MoCo v1/v2,BYOL,CLIP,PixPro,simsiam, SwAV, BEiT,MAE 等图像自监督算法以及 Vision Transformer,DEiT,Swin Transformer,CvT,T2T-ViT,MLP-Mixer,XCiT,ConvNeXt,PVTv2 等基础视觉算法
Created
2021-01-28
127 commits to main branch, last one 10 months ago
Official Code of Paper "Reversible Column Networks" "RevColv2"
Created
2022-12-22
17 commits to main branch, last one 8 months ago
i. A practical application of Transformer (ViT) on 2-D physiological signal (EEG) classification tasks. Also could be tried with EMG, EOG, ECG, etc. ii. Including the attention of spatial dimension (c...
Created
2021-05-27
30 commits to main branch, last one about a year ago
HugsVision is a easy to use huggingface wrapper for state-of-the-art computer vision
Created
2021-08-12
75 commits to main branch, last one about a year ago
Mimix: A Text Generation Tool and Pretrained Chinese Models
vit
clip
gpt-2
seq2seq
chinese-nlp
generative-qa
summarization
tag-generation
chinese-chatbot
text-similarity
essay-generation
novel-generation
poetry-generation
pretrained-models
comment-generation
question-generation
spelling-correction
product-review-generation
chinese-english-translator
product-description-generation
Created
2021-08-13
235 commits to main branch, last one a day ago
reproduction of semantic segmentation using masked autoencoder (mae)
Created
2022-02-03
3 commits to main branch, last one 2 years ago
Paddle Large Scale Classification Tools,supports ArcFace, CosFace, PartialFC, Data Parallel + Model Parallel. Model includes ResNet, ViT, Swin, DeiT, CaiT, FaceViT, MoCo, MAE, ConvMAE, CAE.
Created
2019-12-13
172 commits to master branch, last one 12 months ago
[MedIA Journal] An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites
Created
2022-03-13
86 commits to main branch, last one 6 months ago
A ViT based transformer applied on multi-channel time-series EEG data for motor imagery classification
Created
2022-02-19
8 commits to main branch, last one about a year ago
Official implementation for paper "LightViT: Towards Light-Weight Convolution-Free Vision Transformers"
Created
2022-07-05
10 commits to main branch, last one about a year ago
My implementation of "Patch n’ Pack: NaViT, a Vision Transformer for any Aspect Ratio and Resolution"
Created
2023-09-28
18 commits to main branch, last one 3 months ago
🚀 React application framework inspired by UmiJS / 类 UmiJS 的 React 应用框架
Created
2021-03-24
139 commits to master branch, last one about a year ago
Open source implementation of "Vision Transformers Need Registers"
Created
2023-10-04
21 commits to main branch, last one 3 months ago
An unofficial implementation of ViTPose [Y. Xu et al., 2022]
Created
2022-12-18
23 commits to main branch, last one about a year ago
This code repository contains the code used for my "Optimizing Memory Usage for Training LLMs and Vision Transformers in PyTorch" blog post.
Created
2023-06-27
11 commits to main branch, last one 10 months ago