We introduce a new neural architecture for solving visual abstract reasoning tasks inspired by human cognition, specifically by observations that human abstract reasoning often interleaves perceptual and conceptual processing as part of a flexible, iterative, and dynamic cognitive process. Inspired by this principle, our architecture models visual abstract reasoning as an iterative, self-contrasting learning process that pursues consistency between perceptual and conceptual processing of visual stimuli. We explain how this new Contrastive Perceptual-Conceptual Network (CPCNet) works using matrix reasoning problems in the style of the well-known Raven's Progressive Matrices intelligence test. Experiments on the machine learning dataset RAVEN show that CPCNet achieves higher accuracy than all previously published models while also using the weakest inductive bias. We also point out a substantial and previously unremarked class imbalance in the original RAVEN dataset, and we propose a new variant of RAVEN -- AB-RAVEN -- that is more balanced in terms of abstract concepts.
翻译:我们提出了一种受人类认知启发的新神经架构,用于解决视觉抽象推理任务,其灵感尤其来源于人类抽象推理过程中常将感知处理与概念处理交织进行的观察,这一过程具有灵活、迭代和动态的特点。基于这一原理,我们的架构将视觉抽象推理建模为一种迭代的、自我对比的学习过程,旨在追求视觉刺激的感知与概念处理之间的一致性。我们以著名的瑞文推理测验风格矩阵推理问题为例,阐释这种新型对比感知-概念网络(CPCNet)的工作原理。在机器学习数据集RAVEN上的实验表明,CPCNet在采用最弱归纳偏置的同时,取得了高于所有先前已发表模型的准确率。我们还指出了原始RAVEN数据集中一个此前未被注意到的严重类别不平衡问题,并提出了一种新变体——AB-RAVEN——该变体在抽象概念方面更为均衡。