4 results found Sort:

79
1.6k
apache-2.0
21
Emu Series: Generative Multimodal Models from BAAI
Created 2023-07-11
40 commits to main branch, last one 3 months ago
The Paper List of Large Multi-Modality Model, Parameter-Efficient Finetuning, Vision-Language Pretraining, Conventional Image-Text Matching for Preliminary Insight.
Created 2020-12-22
128 commits to main branch, last one 24 days ago
11
267
apache-2.0
5
Youku-mPLUG: A 10 Million Large-scale Chinese Video-Language Pre-training Dataset and Benchmarks
Created 2023-06-06
18 commits to main branch, last one 5 months ago
17
212
apache-2.0
6
mPLUG-2: A Modularized Multi-modal Foundation Model Across Text, Image and Video (ICML 2023)
Created 2023-05-22
4 commits to main branch, last one 11 months ago