Search Results - RepositoryStats

62

1.1k

apache-2.0

14

Pocket-Sized Multimodal AI for content understanding and generation across multilingual texts, images, and 🔜 video, up to 5x faster than OpenAI CLIP and LLaVA 🖼️ & 🖋️

Created 2023-02-21

298 commits to main branch, last one about a month ago

RLIPv2 JacobYuan7

3

122

apache-2.0

2

[ICCV 2023] RLIPv2: Fast Scaling of Relational Language-Image Pre-training

detection language-vision scene-graph-generation human-object-interaction

Created 2023-05-10

50 commits to main branch, last one 8 months ago

Language-Conditioned-Affordance-Pose-Detection-in-3D-Point-Clouds Fsoft-AIC

7

30

mit

1

[ICRA 2024] Language-Conditioned Affordance-Pose Detection in 3D Point Clouds

robotics icra-2024 language-vision pose-estimation diffusion-models affordance-detection

Created 2024-03-24

9 commits to main branch, last one about a month ago