Perturb-and-Compare Approach for Detecting Out-of-Distribution Samples in Constrained Access Environments

Accessing machine learning models through remote APIs has been gaining prevalence following the recent trend of scaling up model parameters for increased performance. Even though these models exhibit remarkable ability, detecting out-of-distribution (OOD) samples remains a crucial safety concern for end users as these samples may induce unreliable outputs from the model. In this work, we propose an OOD detection framework, MixDiff, that is applicable even when the model's parameters or its activations are not accessible to the end user. To bypass the access restriction, MixDiff applies an identical input-level perturbation to a given target sample and a similar in-distribution (ID) sample, then compares the relative difference in the model outputs of these two samples. MixDiff is model-agnostic and compatible with existing output-based OOD detection methods. We provide theoretical analysis to illustrate MixDiff's effectiveness in discerning OOD samples that induce overconfident outputs from the model and empirically demonstrate that MixDiff consistently enhances the OOD detection performance on various datasets in vision and text domains.

翻译：随着模型参数规模扩大以提升性能的趋势日益显著，通过远程API访问机器学习模型已逐渐成为主流。尽管这些模型展现出卓越的能力，但检测分布外（OOD）样本对终端用户而言仍是关键的安全问题，因为此类样本可能导致模型产生不可靠的输出。本研究提出了一种OOD检测框架MixDiff，该框架即使在终端用户无法访问模型参数或激活值的情况下仍可适用。为突破访问限制，MixDiff对给定的目标样本与相似的分布内（ID）样本施加相同的输入级扰动，进而比较这两个样本在模型输出上的相对差异。MixDiff具有模型无关性，且兼容现有的基于输出的OOD检测方法。我们通过理论分析阐明了MixDiff在识别引发模型过度自信输出的OOD样本方面的有效性，并通过实证证明MixDiff在视觉与文本领域的多种数据集上均能持续提升OOD检测性能。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日