9 results found Sort:

42
589
bsd-3-clause
12
[CVPR 2024] MovieChat: From Dense Token to Sparse Memory for Long Video Understanding
Created 2023-06-26
122 commits to main branch, last one 24 days ago
65
588
bsd-3-clause
12
Official code for Goldfish model for long video understanding and MiniGPT4-video for short video understanding
Created 2024-03-23
190 commits to main branch, last one 2 months ago
36
342
other
15
"VideoRAG: Retrieval-Augmented Generation with Extreme Long-Context Videos"
Created 2025-02-03
12 commits to main branch, last one 5 days ago
0
175
unknown
4
🔥🔥MLVU: Multi-task Long Video Understanding Benchmark
Created 2024-06-02
149 commits to main branch, last one 4 days ago
This is the official implementation of our paper "Video-RAG: Visually-aligned Retrieval-Augmented Long Video Comprehension"
Created 2024-11-19
38 commits to main branch, last one 4 hours ago
Multi-granularity Correspondence Learning from Long-term Noisy Videos [ICLR 2024, Oral]
Created 2024-01-18
15 commits to main branch, last one 10 months ago
[EMNLP 2023] TESTA: Temporal-Spatial Token Aggregation for Long-form Video-Language Understanding
Created 2023-10-29
9 commits to main branch, last one about a year ago
Language Repository for Long Video Understanding
Created 2024-03-19
10 commits to main branch, last one 8 months ago
2
29
bsd-3-clause
1
Winner solution to Generic Event Boundary Captioning task in LOVEU Challenge (CVPR 2023 workshop)
Created 2023-05-24
111 commits to main branch, last one about a year ago