13 results found Sort:
awesome grounding: A curated list of research papers in visual grounding
Created
2018-09-03
97 commits to master branch, last one about a year ago
[ICCV2023] UniVTG: Towards Unified Video-Language Temporal Grounding
Created
2023-05-27
134 commits to main branch, last one 7 months ago
PG-Video-LLaVA: Pixel Grounding in Large Multimodal Video Models
Created
2023-11-20
8 commits to main branch, last one 11 months ago
Awesome papers & datasets specifically focused on long-term videos.
Created
2022-07-11
47 commits to main branch, last one about a month ago
[MM'24 Oral] Prior Knowledge Integration via LLM Encoding and Pseudo Event Regulation for Video Moment Retrieval
Created
2024-07-16
36 commits to main branch, last one 4 months ago
[CVPR2022] Animal Kingdom: A Large and Diverse Dataset for Animal Behavior Understanding
Created
2022-03-18
104 commits to master branch, last one 18 days ago
Official pytorch repository for CG-DETR "Correlation-guided Query-Dependency Calibration in Video Representation Learning for Temporal Grounding"
Created
2023-11-10
18 commits to main branch, last one 4 months ago
"Video Moment Retrieval from Text Queries via Single Frame Annotation" in SIGIR 2022
Created
2022-04-12
5 commits to master branch, last one 2 years ago
Can I Trust Your Answer? Visually Grounded Video Question Answering (CVPR'24, Highlight)
Created
2023-08-28
54 commits to main branch, last one 5 months ago
Official Tensorflow Implementation of the AAAI-2020 paper "Temporally Grounding Language Queries in Videos by Contextual Boundary-aware Prediction"
Created
2019-12-15
22 commits to master branch, last one 4 years ago
[CVPR 2024] Context-Guided Spatio-Temporal Video Grounding
Created
2024-01-02
119 commits to main branch, last one 6 months ago
Official pytorch implementation of "Explore-And-Match: Bridging Proposal-Based and Proposal-Free With Transformer for Sentence Grounding in Videos"
Created
2022-04-18
8 commits to main branch, last one 2 years ago
paper list on Video Moment Retrieval (VMR), or Natural Language Video Localization (NLVL), or Temporal Sentence Grounding in Videos (TSGV))
Created
2023-01-11
5 commits to main branch, last one about a year ago