10 results found Sort:

67
605
bsd-3-clause
12
Official code for Goldfish model for long video understanding and MiniGPT4-video for short video understanding
Created 2024-03-23
190 commits to main branch, last one 3 months ago
42
604
bsd-3-clause
12
[CVPR 2024] MovieChat: From Dense Token to Sparse Memory for Long Video Understanding
Created 2023-06-26
122 commits to main branch, last one 2 months ago
58
514
other
20
"VideoRAG: Retrieval-Augmented Generation with Extreme Long-Context Videos"
Created 2025-02-03
44 commits to main branch, last one 3 days ago
0
188
unknown
4
🔥🔥MLVU: Multi-task Long Video Understanding Benchmark
Created 2024-06-02
158 commits to main branch, last one 6 days ago
This is the official implementation of our paper "Video-RAG: Visually-aligned Retrieval-Augmented Long Video Comprehension"
Created 2024-11-19
38 commits to main branch, last one about a month ago
Multi-granularity Correspondence Learning from Long-term Noisy Videos [ICLR 2024, Oral]
Created 2024-01-18
15 commits to main branch, last one 11 months ago
[EMNLP 2023] TESTA: Temporal-Spatial Token Aggregation for Long-form Video-Language Understanding
Created 2023-10-29
9 commits to main branch, last one about a year ago
Language Repository for Long Video Understanding
Created 2024-03-19
10 commits to main branch, last one 9 months ago
2
30
bsd-3-clause
1
Winner solution to Generic Event Boundary Captioning task in LOVEU Challenge (CVPR 2023 workshop)
Created 2023-05-24
111 commits to main branch, last one about a year ago
[ICLR 2025] TimeSuite: Improving MLLMs for Long Video Understanding via Grounded Tuning
Created 2025-01-05
18 commits to main branch, last one 11 days ago