×
Jan 3, 2021 · This work aims to tackle the joint understanding of vision and language under a Commands for Autonomous Vehicles (C4AV) setting.
In this work, we focus on the object referral problem in the autonomous driving setting. We use a stacked visual-linguistic BERT model to learn a generic visual ...
Nov 21, 2024 · In this work, we focus on the object referral problem in the autonomous driving setting. We use a stacked visual-linguistic BERT model to ...
In this work, we focus on the object referral problem in the autonomous driving setting. We use a stacked visual-linguistic BERT model to learn a generic ...
This work deviate from recent, popular task settings and considers a situation where passengers can give free-form natural language commands to a vehicle ...
People also ask
Commands for Autonomous Vehicles by Progressively Stacking Visual-Linguistic Representations. ; dc:identifier, DBLP conf/eccv/DaiLDS20 (xsd:string) ; dc: ...
Sep 8, 2024 · The task of visual grounding requires locating the most relevant region or object in an image, given a natural language query.
Jan 2, 2021 · Part II focusses on commands for autonomous vehicles; computer vision for ART analysis; sign language recognition, translation and production; ...
... Commands for autonomous vehicles by progressively stacking visual-linguistic representations. In: Proceedings of the 16th European Conference on Computer ...
... Commands for autonomous vehicles by progressively stacking visual-linguistic representations. In: Proceedings of the 16th European Conference on Computer ...