3 results found Sort:

56
956
apache-2.0
13
Pocket-Sized Multimodal AI for content understanding and generation across multilingual texts, images, and 🔜 video, up to 5x faster than OpenAI CLIP and LLaVA 🖼️ & 🖋️
Created 2023-02-21
286 commits to main branch, last one 2 months ago
Reproducible scaling laws for contrastive language-image learning (https://arxiv.org/abs/2212.07143)
Created 2022-12-13
54 commits to master branch, last one 7 months ago
Using Segment-Anything and CLIP to generate pixel-aligned semantic features.
Created 2023-04-20
2 commits to main branch, last one about a year ago