An Ambiguity Measure for Recognizing the Unknowns in Deep Learning

We study the understanding of deep neural networks from the scope in which they are trained on. While the accuracy of these models is usually impressive on the aggregate level, they still make mistakes, sometimes on cases that appear to be trivial. Moreover, these models are not reliable in realizing what they do not know leading to failures such as adversarial vulnerability and out-of-distribution failures. Here, we propose a measure for quantifying the ambiguity of inputs for any given model with regard to the scope of its training. We define the ambiguity based on the geometric arrangements of the decision boundaries and the convex hull of training set in the feature space learned by the trained model, and demonstrate that a single ambiguity measure may detect a considerable portion of mistakes of a model on in-distribution samples, adversarial inputs, as well as out-of-distribution inputs. Using our ambiguity measure, a model may abstain from classification when it encounters ambiguous inputs leading to a better model accuracy not just on a given testing set, but on the inputs it may encounter at the world at large. In pursuit of this measure, we develop a theoretical framework that can identify the unknowns of the model in relation to its scope. We put this in perspective with the confidence of the model and develop formulations to identify the regions of the domain which are unknown to the model, yet the model is guaranteed to have high confidence.

翻译：我们研究深度神经网络在其训练范围的理解。尽管这些模型在总体水平上的准确率通常令人印象深刻，但它们仍然会犯错，有时甚至是在看似简单的问题上。此外，这些模型无法可靠地意识到自身未知的领域，从而导致了诸如对抗性脆弱性和分布外失效等问题。本文提出了一种度量方法，用于量化任意给定模型在其训练范围内输入数据的模糊度。我们基于训练好的模型在特征空间中所学习到的决策边界几何排列和训练集的凸包来定义模糊度，并证明单一的模糊度度量可以检测出模型在分布内样本、对抗性输入以及分布外输入中的相当一部分错误。利用我们的模糊度度量，当模型遇到模糊输入时可以拒绝分类，从而不仅能提高模型在给定测试集上的准确率，还能提升其在现实世界中可能遇到的各种输入上的表现。为实现这一度量，我们构建了一个理论框架，用于识别模型在其范围内未知的领域。我们将这一框架与模型的置信度相结合，提出了相应的公式来识别那些模型未知但保证具有高置信度的域空间。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

O’Reilly报告：知识图谱崛起——面向现代数据集成和数据结构体系，“The Rise of the Knowledge Graph——Toward Modern Data Integration and the Data Fabric Architecture”

专知会员服务

49+阅读 · 2022年2月18日

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

35+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日