21 results found Sort:
- Filter by Primary Language:
- Python (16)
- Jupyter Notebook (1)
- +
awesome grounding: A curated list of research papers in visual grounding
Created
2018-09-03
97 commits to master branch, last one about a year ago
paper list of robotic grasping and some related works
Created
2022-03-29
82 commits to main branch, last one 23 days ago
[ECCV 2020] ScanRefer: 3D Object Localization in RGB-D Scans using Natural Language
Created
2020-01-22
104 commits to master branch, last one about a year ago
A curated list of papers and resources related to Described Object Detection, Open-Vocabulary/Open-World Object Detection and Referring Expression Comprehension. Updated frequently and pull requests w...
Created
2023-09-07
43 commits to main branch, last one 2 days ago
[CVPR 2022 Oral] TubeDETR: Spatio-Temporal Video Grounding with Transformers
Created
2022-03-19
18 commits to main branch, last one about a year ago
[CVPR 2022] Pseudo-Q: Generating Pseudo Language Queries for Visual Grounding
Created
2022-03-14
43 commits to main branch, last one 4 months ago
SeqTR: A Simple yet Universal Network for Visual Grounding
Created
2022-03-30
27 commits to main branch, last one 24 days ago
[CVPR 2023] EDA: Explicit Text-Decoupling and Dense Alignment for 3D Visual Grounding
Created
2022-09-30
4 commits to master branch, last one about a year ago
A collection of 3D vision and language (e.g., 3D Visual Grounding, 3D Question Answering and 3D Dense Caption) papers and datasets.
Created
2022-04-15
21 commits to main branch, last one about a year ago
Improving Visual Grounding with Visual-Linguistic Verification and Iterative Reasoning, CVPR 2022
Created
2022-04-29
5 commits to master branch, last one about a year ago
Referring Video Object Segmentation / Multi-Object Tracking Repo
Created
2021-12-11
58 commits to main branch, last one about a year ago
[ICCV 2023] Multi3DRefer: Grounding Text Description to Multiple 3D Objects
Created
2023-06-01
42 commits to main branch, last one about a year ago
[CoRL 2024] VLM-Grounder: A VLM Agent for Zero-Shot 3D Visual Grounding
Created
2024-10-17
5 commits to main branch, last one about a month ago
Visual Relation Grounding in Videos (ECCV'20, Spotlight)
Created
2019-11-28
97 commits to master branch, last one 2 years ago
[CVPR2022 Oral] 3DJCG: A Unified Framework for Joint Dense Captioning and Visual Grounding on 3D Point Clouds
Created
2022-04-19
10 commits to main branch, last one about a year ago
[Paper][AAAI 2023] DUET: Cross-modal Semantic Grounding for Contrastive Zero-shot Learning
Created
2022-11-27
48 commits to main branch, last one 9 months ago
An offical repo for ECCV 2024 Towards Natural Language-Guided Drones: GeoText-1652 Benchmark with Spatial Relation Matching
Created
2024-07-12
44 commits to main branch, last one about a month ago
[CVPR2021] Look before you leap: learning landmark features for one-stage visual grounding.
Created
2021-03-28
12 commits to main branch, last one 3 years ago
[CVPR 2024] Visual Programming for Zero-shot Open-Vocabulary 3D Visual Grounding
Created
2023-11-25
25 commits to main branch, last one 3 months ago
[ECCV2022] D3Net: A Unified Speaker-Listener Architecture for 3D Dense Captioning and Visual Grounding
Created
2021-11-30
12 commits to main branch, last one 2 years ago
[ICCV2021] 3DVG-Transformer: Relation Modeling for Visual Grounding on Point Clouds
Created
2022-04-08
19 commits to main branch, last one 2 years ago