用于视觉推理的诊断数据集
摘要
介绍
visual question answering VQA
The two main differences betweenCLEVR and other VQA datasets are that: (1) CLEVR controls biases found in prior VQA datasets that can be usedby learning systems to answer questions correctly without visual reasoning and (2) CLEVR’s synthetic nature and de-tailed annotations facilitate in-depth analyses of reasoning abilities that are impossible with existing datasets.
Related work
The Clevr Diagnostic Dataset
- question family, question generation,