21 results found Sort:

paper list of robotic grasping and some related works
Created 2022-03-29
82 commits to main branch, last one 23 days ago
[ECCV 2020] ScanRefer: 3D Object Localization in RGB-D Scans using Natural Language
Created 2020-01-22
104 commits to master branch, last one about a year ago
A curated list of papers and resources related to Described Object Detection, Open-Vocabulary/Open-World Object Detection and Referring Expression Comprehension. Updated frequently and pull requests w...
Created 2023-09-07
43 commits to main branch, last one 2 days ago
8
172
apache-2.0
3
[CVPR 2022 Oral] TubeDETR: Spatio-Temporal Video Grounding with Transformers
Created 2022-03-19
18 commits to main branch, last one about a year ago
10
144
apache-2.0
3
[CVPR 2022] Pseudo-Q: Generating Pseudo Language Queries for Visual Grounding
Created 2022-03-14
43 commits to main branch, last one 4 months ago
14
131
unknown
1
SeqTR: A Simple yet Universal Network for Visual Grounding
Created 2022-03-30
27 commits to main branch, last one 24 days ago
4
109
other
3
[CVPR 2023] EDA: Explicit Text-Decoupling and Dense Alignment for 3D Visual Grounding
Created 2022-09-30
4 commits to master branch, last one about a year ago
A collection of 3D vision and language (e.g., 3D Visual Grounding, 3D Question Answering and 3D Dense Caption) papers and datasets.
Created 2022-04-15
21 commits to main branch, last one about a year ago
8
91
unknown
2
Improving Visual Grounding with Visual-Linguistic Verification and Iterative Reasoning, CVPR 2022
Created 2022-04-29
5 commits to master branch, last one about a year ago
Referring Video Object Segmentation / Multi-Object Tracking Repo
Created 2021-12-11
58 commits to main branch, last one about a year ago
[ICCV 2023] Multi3DRefer: Grounding Text Description to Multiple 3D Objects
Created 2023-06-01
42 commits to main branch, last one about a year ago
[CoRL 2024] VLM-Grounder: A VLM Agent for Zero-Shot 3D Visual Grounding
Created 2024-10-17
5 commits to main branch, last one about a month ago
7
57
unknown
3
Visual Relation Grounding in Videos (ECCV'20, Spotlight)
Created 2019-11-28
97 commits to master branch, last one 2 years ago
[CVPR2022 Oral] 3DJCG: A Unified Framework for Joint Dense Captioning and Visual Grounding on 3D Point Clouds
Created 2022-04-19
10 commits to main branch, last one about a year ago
8
49
mit
4
[Paper][AAAI 2023] DUET: Cross-modal Semantic Grounding for Contrastive Zero-shot Learning
Created 2022-11-27
48 commits to main branch, last one 9 months ago
An offical repo for ECCV 2024 Towards Natural Language-Guided Drones: GeoText-1652 Benchmark with Spatial Relation Matching
Created 2024-07-12
44 commits to main branch, last one about a month ago
9
47
unknown
3
[CVPR2021] Look before you leap: learning landmark features for one-stage visual grounding.
Created 2021-03-28
12 commits to main branch, last one 3 years ago
1
44
unknown
2
[CVPR 2024] Visual Programming for Zero-shot Open-Vocabulary 3D Visual Grounding
Created 2023-11-25
25 commits to main branch, last one 3 months ago
6
41
unknown
2
[ECCV2022] D3Net: A Unified Speaker-Listener Architecture for 3D Dense Captioning and Visual Grounding
Created 2021-11-30
12 commits to main branch, last one 2 years ago
[ICCV2021] 3DVG-Transformer: Relation Modeling for Visual Grounding on Point Clouds
Created 2022-04-08
19 commits to main branch, last one 2 years ago