Search Results - RepositoryStats

1.1k

13.9k

mit

72

pix2tex: Using a ViT to convert images of equations into LaTeX code.

ocr vit latex python dataset im2text pytorch im2latex math-ocr im2markup latex-ocr image2text transformer deep-learning image-processing machine-learning vision-transformer

Created 2020-12-11

324 commits to main branch, last one 2 months ago

Awesome-Transformer-Attention cmhungsteve

495

4.8k

unknown

128

An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites

vit detr papers transformer awesome-list transformers deep-learning self-attention transformer-cv computer-vision transformer-models vision-transformer visual-transformer attention-mechanism transformer-awesome transformer-with-cv attention-mechanisms transformer-architecture

Created 2021-09-15

1,600 commits to main branch, last one 7 months ago

towhee towhee-io

258

3.3k

apache-2.0

29

Towhee is a framework that is dedicated to making neural data processing pipelines simple and fast.

llm vit milvus towhee pipeline embeddings transformer feature-vector computer-vision image-retrieval image-processing machine-learning video-processing embedding-vectors unstructured-data feature-extraction vision-transformer convolutional-networks

Created 2021-07-13

1,586 commits to main branch, last one 5 months ago

VLMEvalKit open-compass

298

2.1k

apache-2.0

12

Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

gpt llm vit vqa clip gpt4 qwen llava claude gemini gpt-4v openai chatgpt pytorch evaluation openai-api multi-modal computer-vision large-language-models

Created 2023-12-01

1,246 commits to main branch, last one 10 hours ago

Transformer-Explainability hila-chefer

247

1.9k

mit

20

[CVPR 2021] Official PyTorch implementation for Transformer Interpretability Beyond Attention Visualization, a novel method to visualize classifications by Transformer based networks.

vit bert cvpr2021 bert-model perturbation deep-learning explainability attention-matrix vision-transformer attention-visualization visualize-classifications transformer-interpretability

Created 2020-11-23

86 commits to main branch, last one 2 years ago

inference roboflow

158

1.6k

other

23

Turn any computer or edge device into a command center for your computer vision projects.

Created 2023-07-31

6,371 commits to main branch, last one a day ago

PaddleViT BR-IDL

322

1.2k

apache-2.0

11

:robot: PaddleViT: State-of-the-art Visual Transformer and MLP Models for PaddlePaddle 2.0+

cv gan mlp vit detection transformer paddlepaddle segmentation deep-learning classification computer-vision encoder-decoder object-detection semantic-segmentation

Created 2021-08-30

801 commits to develop branch, last one 2 years ago

Transformer-in-Computer-Vision Yangzhangcst

142

1.2k

unknown

40

A paper list of some recent Transformer-based CV works.

vit detr papers awesome transformer deep-learning transformer-cv computer-vision transformer-awesome

Created 2021-04-14

2,731 commits to main branch, last one 21 hours ago

T2T-ViT yitu-opensource

176

1.2k

other

17

ICCV2021, Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet

vit t2t-transformer vision-transformer

Created 2021-01-23

63 commits to main branch, last one 2 years ago

Adan sail-sg

65

780

apache-2.0

7

Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Models

Created 2022-09-01

29 commits to main branch, last one 8 months ago

video_features v-iashin

97

580

mit

6

Extract video features from raw videos using multiple GPUs. We support RAFT flow frames as well as S3D, I3D, R(2+1)D, VGGish, CLIP, and TIMM models.

Created 2020-06-01

100 commits to master branch, last one about a month ago

mobilevit-pytorch chinhsuanwu

73

521

mit

5

A PyTorch implementation of "MobileViT: Light-weight, General-purpose, and Mobile-friendly Vision Transformer"

vit mobilevit mobilenetv2 vision-transformer

Created 2021-10-07

4 commits to master branch, last one 3 years ago

SimpleAICV_pytorch_training_examples zgcr

99

430

mit

7

SimpleAICV:pytorch training and testing examples.

kd mae sam van vit detr fcos dbnet resnet solov2 yolact pytorch dinodetr lightsam retinanet convformer deeplabv3plus segment-anything

Created 2020-05-31

95 commits to master branch, last one 9 days ago

pytorch-vit gupta-abhay

34

293

mit

8

An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale

vit hybrid-vit transformers image-recognition vision-transformer image-classification

Created 2020-10-03

38 commits to main branch, last one 3 years ago

FFCSonTheGo vatz88

84

291

gpl-3.0

6

FFCS course registration made hassle free for VITians. Search courses and visualize the timetable on the go!

vit ffcs vellore timetable javascript hacktoberfest

Created 2016-03-04

744 commits to master branch, last one 4 months ago

EEG-Transformer eeyhsong

29

286

gpl-3.0

3

i. A practical application of Transformer (ViT) on 2-D physiological signal (EEG) classification tasks. Also could be tried with EMG, EOG, ECG, etc. ii. Including the attention of spatial dimension (c...

eeg vit attention transformer deep-learning eeg-classification attention-mechanism physiological-signals common-spatial-pattern

Created 2021-05-27

30 commits to main branch, last one 2 years ago

PASSL PaddlePaddle

65

281

apache-2.0

9

PASSL包含 SimCLR，MoCo v1/v2，BYOL，CLIP，PixPro，simsiam, SwAV, BEiT，MAE 等图像自监督算法以及 Vision Transformer，DEiT，Swin Transformer，CvT，T2T-ViT，MLP-Mixer，XCiT，ConvNeXt，PVTv2 等基础视觉算法

cvt mae pvt vit beit clip deit moco swav xcit paddle pixpro simclr moco-v2 convnext deep-learning swin-transformer vision-transformer self-supervised-learning

Created 2021-01-28

127 commits to main branch, last one about a year ago

RevCol megvii-research

10

259

apache-2.0

12

Official Code of Paper "Reversible Column Networks" "RevColv2"

cnn mae vit pytorch iclr2023 transformer computer-vision

Created 2022-12-22

17 commits to main branch, last one about a year ago

MoH SkyworkAI

9

231

apache-2.0

3

MoH: Multi-Head Attention as Mixture-of-Head Attention

dit moe vit llms attention transformer mixture-of-experts

Created 2024-10-08

19 commits to main branch, last one 4 months ago

NaViT kyegomez

10

224

mit

6

My implementation of "Patch n’ Pack: NaViT, a Vision Transformer for any Aspect Ratio and Resolution"

vit clip gpt4 multimodal multimodality attention-mechanism multimodal-learning multimodal-deep-learning

Created 2023-09-28

18 commits to main branch, last one about a year ago

Awesome-Diffusion-Inference DefTruth

13

198

gpl-3.0

8

📖A curated list of Awesome Diffusion Inference Papers with codes: Sampling, Caching, Multi-GPUs, etc. 🎉🎉

dit gpu vit sd15 sdxl sora deepcache diffusion inference open-sora multi-gpus open-sora-plan stable-diffusion

Created 2024-01-14

66 commits to main branch, last one 2 months ago

HugsVision qanastek

21

195

mit

2

HugsVision is a easy to use huggingface wrapper for state-of-the-art computer vision

Created 2021-08-12

75 commits to main branch, last one 2 years ago

Awesome-Transformer-in-Medical-Imaging xmindflow

24

194

gpl-3.0

1

[MedIA Journal] An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites

vit transformer awesome-list segmentation transformers deep-learning computer-vision vision-transformer attention-mechanism medical-image-segmentation

Created 2022-03-13

86 commits to main branch, last one about a year ago

EEG-Transformer zwcolin

20

170

gpl-3.0

3

A ViT based transformer applied on multi-channel time-series EEG data for motor imagery classification

bci vit transformer eeg-classification

Created 2022-02-19

9 commits to main branch, last one 3 months ago

Vit-RGTS kyegomez

15

168

mit

5

Open source implementation of "Vision Transformers Need Registers"

vit gpt4 vision-api vision-transformer attention-mechanism

Created 2023-10-04

21 commits to main branch, last one about a year ago

mae_segmentation implus

14

161

unknown

3

reproduction of semantic segmentation using masked autoencoder (mae)

mae vit masked-autoencoder vision-transformer semantic-segmentation self-supervised-learning

Created 2022-02-03

3 commits to main branch, last one 3 years ago

mimix yaoxiaoyuan

17

155

apache-2.0

2

Mimix: A Text Generation Tool and Pretrained Chinese Models

Created 2021-08-13

238 commits to main branch, last one 4 months ago

PLSC PaddlePaddle

34

151

apache-2.0

19

Paddle Large Scale Classification Tools，supports ArcFace, CosFace, PartialFC, Data Parallel + Model Parallel. Model includes ResNet, ViT, Swin, DeiT, CaiT, FaceViT, MoCo, MAE, ConvMAE, CAE.

Created 2019-12-13

172 commits to master branch, last one about a year ago

LightViT hunto

10

139

apache-2.0

2

Official implementation for paper "LightViT: Towards Light-Weight Convolution-Free Vision Transformers"

vit backbone imagenet lightvit

Created 2022-07-05

10 commits to main branch, last one 2 years ago

PyTorch-Scratch-Vision-Transformer-ViT s-chh

20

131

mit

1

Simple and easy to understand PyTorch implementation of Vision Transformer (ViT) from scratch, with detailed steps. Tested on common datasets like MNIST, CIFAR10, and more.

vit simple scratch vit-svhn vit-cifar vit-mnist vit-simple pytorch-vit transformer vit-cifar10 vit-scratch vit-fashionmnist transformer-mnist vision-transformer transformer-cifar10

Created 2021-10-14

170 commits to main branch, last one 2 months ago