3 results found Sort:
Pocket-Sized Multimodal AI for content understanding and generation across multilingual texts, images, and 🔜 video, up to 5x faster than OpenAI CLIP and LLaVA 🖼️ & 🖋️
Created
2023-02-21
293 commits to main branch, last one 2 months ago
[ICCV 2023] RLIPv2: Fast Scaling of Relational Language-Image Pre-training
Created
2023-05-10
50 commits to main branch, last one 6 months ago
[ICRA 2024] Language-Conditioned Affordance-Pose Detection in 3D Point Clouds
Created
2024-03-24
8 commits to main branch, last one 2 months ago