8 results found Sort:
Official code for Goldfish model for long video understanding and MiniGPT4-video for short video understanding
Created
2024-03-23
190 commits to main branch, last one about a month ago
[CVPR 2024] MovieChat: From Dense Token to Sparse Memory for Long Video Understanding
Created
2023-06-26
122 commits to main branch, last one 2 days ago
🔥🔥MLVU: Multi-task Long Video Understanding Benchmark
Created
2024-06-02
141 commits to main branch, last one 4 days ago
Multi-granularity Correspondence Learning from Long-term Noisy Videos [ICLR 2024, Oral]
Created
2024-01-18
15 commits to main branch, last one 9 months ago
This is the official implementation of our paper "Video-RAG: Visually-aligned Retrieval-Augmented Long Video Comprehension"
Created
2024-11-19
34 commits to main branch, last one 5 days ago
[EMNLP 2023] TESTA: Temporal-Spatial Token Aggregation for Long-form Video-Language Understanding
Created
2023-10-29
9 commits to main branch, last one about a year ago
Language Repository for Long Video Understanding
Created
2024-03-19
10 commits to main branch, last one 7 months ago
Winner solution to Generic Event Boundary Captioning task in LOVEU Challenge (CVPR 2023 workshop)
Created
2023-05-24
111 commits to main branch, last one about a year ago