Reusable embeddings of user behaviour have shown significant performance improvements for the personalised saliency prediction task. However, prior works require explicit user characteristics and preferences as input, which are often difficult to obtain. We present a novel method to extract user embeddings from pairs of natural images and corresponding saliency maps generated from a small amount of user-specific eye tracking data. At the core of our method is a Siamese convolutional neural encoder that learns the user embeddings by contrasting the image and personal saliency map pairs of different users. Evaluations on two public saliency datasets show that the generated embeddings have high discriminative power, are effective at refining universal saliency maps to the individual users, and generalise well across users and images. Finally, based on our model's ability to encode individual user characteristics, our work points towards other applications that can benefit from reusable embeddings of gaze behaviour.
翻译:从用户行为中提取的可复用嵌入已在个性化显著性预测任务中展现出显著性能提升。然而,现有方法需要将显式用户特征与偏好作为输入,这通常难以获取。本文提出一种新颖方法,通过少量用户特定眼动数据生成的图像-显著性图对来提取用户嵌入。该方法的核心理念是采用孪生卷积神经编码器,通过对比不同用户的图像与个性化显著性图对来学习用户嵌入。在两个公开显著性数据集上的评估表明,生成的嵌入具有高判别力,能有效将通用显著性图调整为个性化结果,并在用户与图像间展现出良好的泛化能力。此外,基于模型编码个体用户特征的能力,本研究提出了注视行为可复用嵌入在其他应用中的潜在价值。