5 results found Sort:
[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
Created
2023-05-06
145 commits to main branch, last one 8 months ago
A new multi-shot video understanding benchmark Shot2Story with comprehensive video summaries and detailed shot-level captions.
Created
2023-12-16
49 commits to master branch, last one 3 days ago
Multi-granularity Correspondence Learning from Long-term Noisy Videos [ICLR 2024, Oral]
Created
2024-01-18
15 commits to main branch, last one 9 months ago
Official Repository of VideoLLaMB: Long Video Understanding with Recurrent Memory Bridges
Created
2024-08-19
13 commits to master branch, last one 4 months ago
A Survey on video and language understanding.
Created
2023-04-14
23 commits to main branch, last one about a year ago