T-FAKE: Synthesizing Thermal Images for Facial Landmarking

Facial analysis is a key component in a wide range of applications such as security, autonomous driving, entertainment, and healthcare. Despite the availability of various facial RGB datasets, the thermal modality, which plays a crucial role in life sciences, medicine, and biometrics, has been largely overlooked. To address this gap, we introduce the T-FAKE dataset, a new large-scale synthetic thermal dataset with sparse and dense landmarks. To facilitate the creation of the dataset, we propose a novel RGB2Thermal loss function, which enables the transfer of thermal style to RGB faces. By utilizing the Wasserstein distance between thermal and RGB patches and the statistical analysis of clinical temperature distributions on faces, we ensure that the generated thermal images closely resemble real samples. Using RGB2Thermal style transfer based on our RGB2Thermal loss function, we create the T-FAKE dataset, a large-scale synthetic thermal dataset of faces. Leveraging our novel T-FAKE dataset, probabilistic landmark prediction, and label adaptation networks, we demonstrate significant improvements in landmark detection methods on thermal images across different landmark conventions. Our models show excellent performance with both sparse 70-point landmarks and dense 478-point landmark annotations. Our code and models are available at https://github.com/phflot/tfake.

翻译：面部分析是安全、自动驾驶、娱乐和医疗保健等广泛应用中的关键组成部分。尽管存在各种面部RGB数据集，但在生命科学、医学和生物识别中起关键作用的热成像模态在很大程度上被忽视了。为弥补这一空白，我们引入了T-FAKE数据集，这是一个包含稀疏和稠密关键点的大规模合成热成像数据集。为促进数据集的创建，我们提出了一种新颖的RGB2Thermal损失函数，该函数能够将热成像风格迁移到RGB面部图像上。通过利用热成像与RGB图像块之间的Wasserstein距离，并结合面部临床温度分布的统计分析，我们确保生成的热成像图像与真实样本高度相似。基于我们的RGB2Thermal损失函数进行RGB2Thermal风格迁移，我们创建了T-FAKE数据集——一个大规模的面部合成热成像数据集。借助我们新颖的T-FAKE数据集、概率关键点预测和标签自适应网络，我们证明了在不同关键点标注规范下，热成像图像的关键点检测方法均取得显著改进。我们的模型在稀疏70点关键点和稠密478点关键点标注上均表现出优异性能。我们的代码和模型可在https://github.com/phflot/tfake获取。

相关内容

数据集

关注 88

数据集，又称为资料集、数据集合或资料集合，是一种由数据所组成的集合。
Data set（或dataset）是一个数据的集合，通常以表格形式出现。每一列代表一个特定变量。每一行都对应于某一成员的数据集的问题。它列出的价值观为每一个变量，如身高和体重的一个物体或价值的随机数。每个数值被称为数据资料。对应于行数，该数据集的数据可能包括一个或多个成员。

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

【亚马逊-WWW2020】不解析,生成!用于面向任务的语义分析的序列到序列体系结构，Don't Parse, Generate! A Sequence to Sequence Architecture for Task-Oriented Semantic Parsing

专知会员服务

15+阅读 · 2020年2月1日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日