8 results found Sort:

65
585
bsd-3-clause
12
Official code for Goldfish model for long video understanding and MiniGPT4-video for short video understanding
Created 2024-03-23
190 commits to main branch, last one about a month ago
42
575
bsd-3-clause
12
[CVPR 2024] MovieChat: From Dense Token to Sparse Memory for Long Video Understanding
Created 2023-06-26
122 commits to main branch, last one 2 days ago
0
173
unknown
4
🔥🔥MLVU: Multi-task Long Video Understanding Benchmark
Created 2024-06-02
141 commits to main branch, last one 4 days ago
Multi-granularity Correspondence Learning from Long-term Noisy Videos [ICLR 2024, Oral]
Created 2024-01-18
15 commits to main branch, last one 9 months ago
This is the official implementation of our paper "Video-RAG: Visually-aligned Retrieval-Augmented Long Video Comprehension"
Created 2024-11-19
34 commits to main branch, last one 5 days ago
[EMNLP 2023] TESTA: Temporal-Spatial Token Aggregation for Long-form Video-Language Understanding
Created 2023-10-29
9 commits to main branch, last one about a year ago
Language Repository for Long Video Understanding
Created 2024-03-19
10 commits to main branch, last one 7 months ago
2
29
bsd-3-clause
1
Winner solution to Generic Event Boundary Captioning task in LOVEU Challenge (CVPR 2023 workshop)
Created 2023-05-24
111 commits to main branch, last one about a year ago