9 results found Sort:

【CVPR'2023 Highlight & TPAMI】Cap4Video: What Can Auxiliary Captions Do for Text-Video Retrieval?
Created 2023-01-07
32 commits to main branch, last one 4 days ago
19
161
mit
12
【CVPR'2023】Bidirectional Cross-Modal Knowledge Exploration for Video Recognition with Pre-trained Vision-Language Models
Created 2023-01-07
31 commits to main branch, last one 2 months ago
EditWorld: Simulating World Dynamics for Instruction-Following Image Editing
Created 2024-05-23
10 commits to main branch, last one 5 months ago
Can I Trust Your Answer? Visually Grounded Video Question Answering (CVPR'24, Highlight)
Created 2023-08-28
54 commits to main branch, last one 5 months ago
Official Repository of VideoLLaMB: Long Video Understanding with Recurrent Memory Bridges
Created 2024-08-19
13 commits to master branch, last one 2 months ago
A Survey on video and language understanding.
Created 2023-04-14
23 commits to main branch, last one about a year ago
12
46
apache-2.0
4
Video Graph Transformer for Video Question Answering (ECCV'22)
Created 2022-07-20
26 commits to main branch, last one about a year ago
Repo for paper: "Paxion: Patching Action Knowledge in Video-Language Foundation Models" Neurips 23 Spotlight
Created 2023-05-08
16 commits to main branch, last one about a year ago
1
31
other
1
[AAAI 2024] DGL: Dynamic Global-Local Prompt Tuning for Text-Video Retrieval.
Created 2024-02-14
28 commits to main branch, last one about a month ago