11 results found Sort:
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
multi-modality
chain-of-thought
instruction-tuning
in-context-learning
instruction-following
large-language-models
visual-instruction-tuning
large-vision-language-model
multimodal-chain-of-thought
large-vision-language-models
multimodal-instruction-tuning
multimodal-in-context-learning
multimodal-large-language-models
Created
2023-05-19
825 commits to main branch, last one a day ago
【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection
Created
2023-10-23
154 commits to main branch, last one 3 months ago
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions
Created
2023-09-26
416 commits to main branch, last one about a month ago
Mixture-of-Experts for Large Vision-Language Models
Created
2023-12-14
228 commits to main branch, last one 3 months ago
🔥 🔥 🔥 [NeurIPS 2024] Hawk: Learning to Understand Open-World Video Anomalies
Created
2024-05-23
18 commits to main branch, last one 24 days ago
[NeurIPS 2024] This repo contains evaluation code for the paper "Are We on the Right Way for Evaluating Large Vision-Language Models"
Created
2024-03-29
19 commits to main branch, last one 5 months ago
Multimodal Chain-of-Thought Reasoning: A Comprehensive Survey
Created
2025-02-15
49 commits to main branch, last one 21 hours ago
[ECCV 2024] API: Attention Prompting on Image for Large Vision-Language Models
Created
2024-09-04
14 commits to master branch, last one 5 months ago
[NeurIPS'24] CARES: A Comprehensive Benchmark of Trustworthiness in Medical Vision Language Models
Created
2024-06-06
16 commits to main branch, last one 3 months ago
🔎Official code for our paper: "VL-Uncertainty: Detecting Hallucination in Large Vision-Language Model via Uncertainty Estimation".
Created
2024-11-07
8 commits to main branch, last one 3 days ago
Awesome Large Vision-Language Model: A Curated List of Large Vision-Language Model
deep-learning
computer-vision
machine-learning
foundation-models
vision-and-language
large-language-models
artificial-intelligence
large-vision-language-model
natural-language-processing
large-vision-language-models
artificial-general-intelligence
general-artificial-intelligence
multimodal-large-language-models
Created
2024-08-10
102 commits to main branch, last one 5 months ago