Search Results - RepositoryStats

12 results found Sort:

Filter by Primary Language:
Python (10)
Kotlin (1)
+

GroupViT NVlabs

52

755

other

11

Official PyTorch implementation of GroupViT: Semantic Segmentation Emerges from Text Supervision, CVPR 2022.

transformers zero-shot-learning image-text-matching semantic-segmentation

Created 2022-03-16

19 commits to main branch, last one 2 years ago

Awesome_Matching_Pretraining_Transfering Paranioar

48

423

mit

13

The Paper List of Large Multi-Modality Model (Perception, Generation, Unification), Parameter-Efficient Finetuning, Vision-Language Pretraining, Conventional Image-Text Matching for Preliminary Insigh...

tutorial awesome-list image-text-matching large-vision-models vision-and-language image-text-retrieval large-language-model video-text-retrieval cross-modal-retrieval large-language-models multimodal-pretraining video-text-recognition memory-efficient-tuning text-to-image-synthesis text-to-image-generation text-to-video-generation visual-semantic-embedding large-vision-language-models parameter-efficient-fine-tuning multimodal-large-language-models

Created 2020-12-22

130 commits to main branch, last one 3 months ago

tidy slavabarkov

28

409

gpl-3.0

7

Offline semantic Text-to-Image and Image-to-Image search on Android powered by quantized state-of-the-art vision-language pretrained CLIP model and ONNX Runtime inference engine

nlp clip onnx kotlin android image-search quantization deep-learning computer-vision image-retrieval semantic-search image-text-matching image-text-retrieval cross-modal-retrieval

Created 2023-02-24

43 commits to main branch, last one about a year ago

SGRAF Paranioar

36

213

unknown

5

[AAAI2021] The code of “Similarity Reasoning and Filtration for Image-Text Matching”

aaai text-matching image-retrieval similarity-metric image-text-matching image-text-retrieval cross-modal-retrieval

Created 2020-12-16

45 commits to main branch, last one 11 months ago

vse_infty woodfrog

16

160

mit

3

Code for "Learning the Best Pooling Strategy for Visual Semantic Embedding", CVPR 2021 (Oral)

vse pytorch vision-language visual-semantic image-text-matching cross-modal-retrieval

Created 2021-01-10

72 commits to master branch, last one 2 years ago

DSRAN kywen1119

12

72

apache-2.0

3

Code for journal paper "Learning Dual Semantic Relations with Graph Attention for Image-Text Matching", TCSVT, 2020.

tcsvt pytorch cross-modal computer-vision image-text-matching

Created 2020-10-22

60 commits to main branch, last one 3 years ago

eccv-caption naver-ai

2

56

other

2

Extended COCO Validation (ECCV) Caption dataset (ECCV 2022)

dataset eccv2022 evaluation vl-benchmark deep-learning machine-learning image-text-matching vision-and-language cross-modal-retrieval

Created 2022-03-30

4 commits to main branch, last one about a year ago

ComCLIP eric-ai-lab

3

35

mit

2

Official implementation and dataset for the NAACL 2024 paper "ComCLIP: Training-Free Compositional Image and Text Matching"

svo clip slip blip2 causality flickr30k winoground compositionality flickr8k-dataset image-text-matching vision-and-language image-text-retrieval

Created 2023-11-10

13 commits to main branch, last one 11 months ago

3

33

apache-2.0

1

[TIP2023] The code of “Plug-and-Play Regulators for Image-Text Matching”

tip regulator text-matching image-retrieval image-text-matching image-text-retrieval cross-modal-retrieval

Created 2023-03-23

15 commits to main branch, last one 11 months ago

CoN-CLIP jaisidhsingh

2

30

other

4

Implementation of the "Learn No to Say Yes Better" paper.

pytorch multimodal deep-learning image-captions compositionality image-text-matching visual-language-models

Created 2024-03-15

7 commits to main branch, last one 20 days ago

LoRA-CLIP jaisidhsingh

2

28

mit

1

Easy wrapper for inserting LoRA layers in CLIP.

lora multimodal image-text-matching low-rank-adaptation multimodal-deep-learning parameter-efficient-tuning vision-language-pretraining

Created 2023-12-11

12 commits to main branch, last one 9 months ago

SEMScene MartinYuanNJU

1

25

unknown

1

Code implementation of paper "SEMScene: Semantic-Consistency Enhanced Multi-Level Scene Graph Matching for Image-Text Retrieval".

scene-graph-models image-text-matching cross-modal-retrieval

Created 2023-11-26

51 commits to main branch, last one 4 months ago