In this paper, we propose a multi-task representation learning framework to jointly estimate the identity, gender and age of individuals from their hand images for the purpose of criminal investigations since the hand images are often the only available information in cases of serious crime such as sexual abuse. We investigate different up-to-date deep learning architectures and compare their performance for joint estimation of identity, gender and age from hand images of perpetrators of serious crime. To simplify the age prediction, we create age groups for the age estimation. We make extensive evaluations and comparisons of both convolution-based and transformer-based deep learning architectures on a publicly available 11k hands dataset. Our experimental analysis shows that it is possible to efficiently estimate not only identity but also other attributes such as gender and age of suspects jointly from hand images for criminal investigations, which is crucial in assisting international police forces in the court to identify and convict abusers.
翻译:本文提出了一种多任务表示学习框架,旨在通过手部图像联合估计个体的身份、性别和年龄,以支撑刑事调查工作——因为在性虐待等严重犯罪案件中,手部图像往往是唯一可用的证据。我们研究了多种前沿深度学习架构,并比较了它们在严重犯罪实施者手部图像中联合估计身份、性别和年龄的性能。为简化年龄预测,我们将年龄划分为不同的组别。基于公开的11k手部数据集,我们对基于卷积和基于Transformer的深度学习架构进行了广泛的评估与比较。实验分析表明,从手部图像中不仅能有效估计嫌疑人的身份,还可同时估计其性别和年龄等属性,这对于协助国际警方在法庭上识别并定罪施虐者具有关键意义。