×
Aug 8, 2019 · We propose a VQA benchmark, CRIC, which introduces new types of questions about Compositional Reasoning on vIsion and Commonsense, and an evaluation metric.
CRIC contains compositional questions to evaluate the ability of a model on alternatively inferring on vision and commonsense.
Abstract—Alternatively inferring on the visual facts and commonsense is fundamental for an advanced visual question answering. (VQA) system.
Alternatively inferring on the visual facts and commonsense is fundamental for an advanced visual question answering (VQA) system.
A VQA benchmark, Compositional Reasoning on vIsion and Commonsense(CRIC), is proposed, which introduces new types of questions about CRIC, and an evaluation ...
To comprehensively evaluate such abilities, we propose a VQA benchmark, CRIC, which introduces new types of questions about Compositional Reasoning on vIsion ...
Oct 22, 2024 · Experimental results show that grounding the commonsense to the image region and joint reasoning on vision and commonsense are still challenging ...
In this paper, we introduce a VQA dataset that provides more challenging and general questions about Compositional Reasoning on vIsion and Commonsense, which is ...
This paper presents a new compositional model that is capable of implementing various types of reasoning functions on the image content and the knowledge ...
Description: A benchnark for visual question answering involving commonsense and compositional reasoning. Examples: Size: 96,000 images. 494,000 questions.