12 results found Sort:
- Filter by Primary Language:
- Python (10)
- Jupyter Notebook (1)
- +
【CVPR'2023 Highlight & TPAMI】Cap4Video: What Can Auxiliary Captions Do for Text-Video Retrieval?
Created
2023-01-07
32 commits to main branch, last one 3 months ago
【CVPR'2023】Bidirectional Cross-Modal Knowledge Exploration for Video Recognition with Pre-trained Vision-Language Models
Created
2023-01-07
31 commits to main branch, last one 6 months ago
EditWorld: Simulating World Dynamics for Instruction-Following Image Editing
Created
2024-05-23
10 commits to main branch, last one 8 months ago
[NeurIPS 2024] Official code for HourVideo: 1-Hour Video Language Understanding
evals
gpt-4
reasoning
gemini-pro
navigation
perception
neurips-2024
summarization
visual-reasoning
benchmark-dataset
egocentric-videos
spatial-intelligence
multiple-choice-questions
long-context-understanding
video-language-understanding
multimodal-large-language-models
1-hour-video-language-understanding
long-form-video-language-understanding
Created
2024-11-27
10 commits to main branch, last one 7 days ago
Official Repository of VideoLLaMB: Long Video Understanding with Recurrent Memory Bridges
Created
2024-08-19
14 commits to master branch, last one 15 days ago
Can I Trust Your Answer? Visually Grounded Video Question Answering (CVPR'24, Highlight)
Created
2023-08-28
54 commits to main branch, last one 8 months ago
A Survey on video and language understanding.
Created
2023-04-14
23 commits to main branch, last one about a year ago
Video Graph Transformer for Video Question Answering (ECCV'22)
Created
2022-07-20
26 commits to main branch, last one about a year ago
[2021 MultiMedia] CONQUER: Contextual Query-aware Ranking for Video Corpus Moment Retrieval
Created
2021-09-20
11 commits to master branch, last one 3 years ago
[AAAI 2024] DGL: Dynamic Global-Local Prompt Tuning for Text-Video Retrieval.
Created
2024-02-14
28 commits to main branch, last one 5 months ago
Repo for paper: "Paxion: Patching Action Knowledge in Video-Language Foundation Models" Neurips 23 Spotlight
Created
2023-05-08
16 commits to main branch, last one about a year ago
[2023 ACL] CONE: An Efficient COarse-to-fiNE Alignment Framework for Long Video Temporal Grounding
Created
2022-11-16
7 commits to main branch, last one about a year ago