11 results found Sort:
- Filter by Primary Language:
- Python (9)
- Kotlin (1)
- +
Official PyTorch implementation of GroupViT: Semantic Segmentation Emerges from Text Supervision, CVPR 2022.
Created
2022-03-16
19 commits to main branch, last one 2 years ago
The Paper List of Large Multi-Modality Model (Perception, Generation, Unification), Parameter-Efficient Finetuning, Vision-Language Pretraining, Conventional Image-Text Matching for Preliminary Insigh...
tutorial
awesome-list
image-text-matching
large-vision-models
vision-and-language
image-text-retrieval
large-language-model
video-text-retrieval
cross-modal-retrieval
large-language-models
multimodal-pretraining
video-text-recognition
memory-efficient-tuning
text-to-image-synthesis
text-to-image-generation
text-to-video-generation
visual-semantic-embedding
large-vision-language-models
parameter-efficient-fine-tuning
multimodal-large-language-models
Created
2020-12-22
130 commits to main branch, last one about a month ago
Offline semantic Text-to-Image and Image-to-Image search on Android powered by quantized state-of-the-art vision-language pretrained CLIP model and ONNX Runtime inference engine
Created
2023-02-24
43 commits to main branch, last one about a year ago
[AAAI2021] The code of “Similarity Reasoning and Filtration for Image-Text Matching”
Created
2020-12-16
45 commits to main branch, last one 9 months ago
Code for "Learning the Best Pooling Strategy for Visual Semantic Embedding", CVPR 2021 (Oral)
Created
2021-01-10
72 commits to master branch, last one about a year ago
Code for journal paper "Learning Dual Semantic Relations with Graph Attention for Image-Text Matching", TCSVT, 2020.
Created
2020-10-22
60 commits to main branch, last one 2 years ago
Extended COCO Validation (ECCV) Caption dataset (ECCV 2022)
Created
2022-03-30
4 commits to main branch, last one 11 months ago
Official implementation and dataset for the NAACL 2024 paper "ComCLIP: Training-Free Compositional Image and Text Matching"
Created
2023-11-10
13 commits to main branch, last one 9 months ago
[TIP2023] The code of “Plug-and-Play Regulators for Image-Text Matching”
Created
2023-03-23
15 commits to main branch, last one 9 months ago
Code implementation of paper "SEMScene: Semantic-Consistency Enhanced Multi-Level Scene Graph Matching for Image-Text Retrieval".
Created
2023-11-26
51 commits to main branch, last one 2 months ago
Easy wrapper for inserting LoRA layers in CLIP.
Created
2023-12-11
12 commits to main branch, last one 7 months ago