12 results found Sort:

233
3.2k
apache-2.0
30
【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection
Created 2023-10-23
154 commits to main branch, last one 4 months ago
172
2.8k
apache-2.0
44
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions
Created 2023-09-26
416 commits to main branch, last one 3 months ago
134
2.2k
apache-2.0
23
Mixture-of-Experts for Large Vision-Language Models
Created 2023-12-14
228 commits to main branch, last one 4 months ago
2
194
unknown
5
🔥 🔥 🔥 [NeurIPS 2024] Hawk: Learning to Understand Open-World Video Anomalies
Created 2024-05-23
19 commits to main branch, last one 9 days ago
[NeurIPS 2024] This repo contains evaluation code for the paper "Are We on the Right Way for Evaluating Large Vision-Language Models"
Created 2024-03-29
19 commits to main branch, last one 6 months ago
[ECCV 2024] API: Attention Prompting on Image for Large Vision-Language Models
Created 2024-09-04
14 commits to master branch, last one 6 months ago
[NeurIPS'24] CARES: A Comprehensive Benchmark of Trustworthiness in Medical Vision Language Models
Created 2024-06-06
16 commits to main branch, last one 4 months ago
✨✨latest advancements in VLA models(VIsion Language Action)
Created 2025-04-13
24 commits to main branch, last one 9 days ago
🔎Official code for our paper: "VL-Uncertainty: Detecting Hallucination in Large Vision-Language Model via Uncertainty Estimation".
Created 2024-11-07
8 commits to main branch, last one about a month ago