The Earth mover's distance (EMD) is a useful metric for image recognition and classification, but its usual implementations are not differentiable or too slow to be used as a loss function for training other algorithms via gradient descent. In this paper, we train a convolutional neural network (CNN) to learn a differentiable, fast approximation of the EMD and demonstrate that it can be used as a substitute for computing-intensive EMD implementations. We apply this differentiable approximation in the training of an autoencoder-inspired neural network (encoder NN) for data compression at the high-luminosity LHC at CERN. The goal of this encoder NN is to compress the data while preserving the information related to the distribution of energy deposits in particle detectors. We demonstrate that the performance of our encoder NN trained using the differentiable EMD CNN surpasses that of training with loss functions based on mean squared error.
翻译:推土机距离(EMD)是图像识别与分类中的一种有效度量,但其常规实现要么不可微,要么因速度过慢而无法作为损失函数用于通过梯度下降训练其他算法。本文训练了一个卷积神经网络(CNN)以学习EMD的可微、快速近似,并证明其可作为计算密集型EMD实现的替代方案。我们将在CERN高亮度LHC的数据压缩中,将此可微近似应用于一种受自编码器启发的神经网络(编码器NN)的训练。该编码器NN的目标是在压缩数据的同时,保留与粒子探测器中能量沉积分布相关的信息。我们证明,使用可微EMD CNN训练的编码器NN的性能,优于基于均方误差损失函数训练的性能。