3 results found Sort:
[CVPR'24] HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(ision), LLaVA-1.5, and Other Multi-modality Models
Created
2023-10-22
132 commits to main branch, last one 3 months ago
[CVPR 2024] Official implementation of "ViTamin: Designing Scalable Vision Models in the Vision-language Era"
Created
2024-04-02
48 commits to main branch, last one 20 days ago
Official PyTorch Implementation of Seeing the Image: Prioritizing Visual Correlation by Contrastive Alignment
Created
2024-05-28
14 commits to main branch, last one 16 days ago