18 results found Sort:

[ECCV 2020] ScanRefer: 3D Object Localization in RGB-D Scans using Natural Language
Created 2020-01-22
104 commits to master branch, last one about a year ago
paper list of robotic grasping and some related works
Created 2022-03-29
77 commits to main branch, last one about a month ago
8
163
apache-2.0
3
[CVPR 2022 Oral] TubeDETR: Spatio-Temporal Video Grounding with Transformers
Created 2022-03-19
18 commits to main branch, last one 9 months ago
A curated list of papers and resources related to Described Object Detection, Open-Vocabulary/Open-World Object Detection and Referring Expression Comprehension. Updated frequently and pull requests w...
Created 2023-09-07
38 commits to main branch, last one 12 days ago
10
142
apache-2.0
3
[CVPR 2022] Pseudo-Q: Generating Pseudo Language Queries for Visual Grounding
Created 2022-03-14
41 commits to main branch, last one about a year ago
14
123
unknown
1
SeqTR: A Simple yet Universal Network for Visual Grounding
Created 2022-03-30
18 commits to main branch, last one 7 months ago
A collection of 3D vision and language (e.g., 3D Visual Grounding, 3D Question Answering and 3D Dense Caption) papers and datasets.
Created 2022-04-15
21 commits to main branch, last one about a year ago
4
94
other
3
[CVPR 2023] EDA: Explicit Text-Decoupling and Dense Alignment for 3D Visual Grounding
Created 2022-09-30
4 commits to master branch, last one 8 months ago
5
86
unknown
2
Improving Visual Grounding with Visual-Linguistic Verification and Iterative Reasoning, CVPR 2022
Created 2022-04-29
5 commits to master branch, last one about a year ago
Referring Video Object Segmentation / Multi-Object Tracking Repo
Created 2021-12-11
58 commits to main branch, last one 11 months ago
[ICCV 2023] Multi3DRefer: Grounding Text Description to Multiple 3D Objects
Created 2023-06-01
42 commits to main branch, last one 8 months ago
7
57
unknown
3
Visual Relation Grounding in Videos (ECCV'20, Spotlight)
Created 2019-11-28
97 commits to master branch, last one 2 years ago
[CVPR2022 Oral] 3DJCG: A Unified Framework for Joint Dense Captioning and Visual Grounding on 3D Point Clouds
Created 2022-04-19
10 commits to main branch, last one about a year ago
9
46
unknown
3
[CVPR2021] Look before you leap: learning landmark features for one-stage visual grounding.
Created 2021-03-28
12 commits to main branch, last one 2 years ago
8
41
mit
4
[Paper][AAAI 2023] DUET: Cross-modal Semantic Grounding for Contrastive Zero-shot Learning
Created 2022-11-27
48 commits to main branch, last one 4 months ago
5
38
unknown
2
[ECCV2022] D3Net: A Unified Speaker-Listener Architecture for 3D Dense Captioning and Visual Grounding
Created 2021-11-30
12 commits to main branch, last one about a year ago
[ICCV2021] 3DVG-Transformer: Relation Modeling for Visual Grounding on Point Clouds
Created 2022-04-08
19 commits to main branch, last one about a year ago