4 results found Sort:
Emu Series: Generative Multimodal Models from BAAI
Created
2023-07-11
41 commits to main branch, last one 2 months ago
The Paper List of Large Multi-Modality Model (Perception, Generation, Unification), Parameter-Efficient Finetuning, Vision-Language Pretraining, Conventional Image-Text Matching for Preliminary Insigh...
tutorial
awesome-list
image-text-matching
large-vision-models
vision-and-language
image-text-retrieval
large-language-model
video-text-retrieval
cross-modal-retrieval
large-language-models
multimodal-pretraining
video-text-recognition
memory-efficient-tuning
text-to-image-synthesis
text-to-image-generation
text-to-video-generation
visual-semantic-embedding
large-vision-language-models
parameter-efficient-fine-tuning
multimodal-large-language-models
Created
2020-12-22
130 commits to main branch, last one 9 days ago
Youku-mPLUG: A 10 Million Large-scale Chinese Video-Language Pre-training Dataset and Benchmarks
Created
2023-06-06
18 commits to main branch, last one 11 months ago
mPLUG-2: A Modularized Multi-modal Foundation Model Across Text, Image and Video (ICML 2023)
Created
2023-05-22
4 commits to main branch, last one about a year ago