Kinship recognition aims to determine whether the subjects in two facial images are kin or non-kin, which is an emerging and challenging problem. However, most previous methods focus on heuristic designs without considering the spatial correlation between face images. In this paper, we aim to learn discriminative kinship representations embedded with the relation information between face components (e.g., eyes, nose, etc.). To achieve this goal, we propose the Face Componential Relation Network, which learns the relationship between face components among images with a cross-attention mechanism, which automatically learns the important facial regions for kinship recognition. Moreover, we propose \Learning, which adapts the loss function by the guidance from cross-attention to learn more discriminative feature representations. The proposed \MainMethodAbbr~outperforms previous state-of-the-art methods by large margins for the largest public kinship recognition FIW benchmark. The code will be publicly released upon acceptance.
翻译:亲属关系识别旨在判断两张面部图像中的主体是否具有亲属关系,这是一个新兴且具有挑战性的问题。然而,以往大多数方法侧重于启发式设计,未考虑面部图像之间的空间相关性。本文旨在学习嵌入面部组件(如眼睛、鼻子等)关系信息的判别性亲属关系表征。为实现此目标,我们提出了面部组件关系网络(Face Componential Relation Network),该网络通过跨注意力机制学习图像间面部组件之间的关系,从而自动识别对亲属关系识别重要的面部区域。此外,我们提出了\学习算法,通过跨注意力机制的指导自适应调整损失函数,以学习更具判别性的特征表征。所提出的\MainMethodAbbr方法在最大的公开亲属关系识别基准FIW上,以显著优势超越了以往的最优方法。代码将在论文被接收后公开。