Out-of-distribution detection is one of the most critical issue in the deployment of machine learning. The data analyst must assure that data in operation should be compliant with the training phase as well as understand if the environment has changed in a way that autonomous decisions would not be safe anymore. The method of the paper is based on eXplainable Artificial Intelligence (XAI); it takes into account different metrics to identify any resemblance between in-distribution and out of, as seen by the XAI model. The approach is non-parametric and distributional assumption free. The validation over complex scenarios (predictive maintenance, vehicle platooning, covert channels in cybersecurity) corroborates both precision in detection and evaluation of training-operation conditions proximity.
翻译:分布外检测是机器学习部署中最关键的问题之一。数据分析师必须确保运行中的数据与训练阶段相一致,同时理解环境是否已发生变化,以至于自主决策不再安全。本文方法基于可解释人工智能(XAI);它考虑不同的度量标准,以识别由XAI模型观察到的分布内和分布外数据之间的相似性。该方法是非参数的且无分布假设。在复杂场景(预测性维护、车辆编队、网络安全中的隐蔽通道)上的验证,证实了检测精度以及对训练-运行条件接近度的评估。