3 results found Sort:
DINO-X: The World's Top-Performing Vision Model for Open-World Object Detection and Understanding
Created
2024-11-20
31 commits to main branch, last one 4 days ago
[ECCV 2024] ControlCap: Controllable Region-level Captioning
Created
2024-01-30
140 commits to main branch, last one 5 months ago
[CVPR 2025] DynRefer: Delving into Region-level Multimodal Tasks via Dynamic Resolution
Created
2024-05-24
104 commits to main branch, last one 26 days ago