Smoothing the Black-Box: Signed-Distance Supervision for Black-Box Model Copying

Deployed machine learning systems must continuously evolve as data, architectures, and regulations change, often without access to original training data or model internals. In such settings, black-box copying provides a practical refactoring mechanism, i.e. upgrading legacy models by learning replicas from input-output queries alone. When restricted to hard-label outputs, copying turns into a discontinuous surface reconstruction problem from pointwise queries, severely limiting the ability to recover boundary geometry efficiently. We propose a distance-based copying (distillation) framework that replaces hard-label supervision with signed distances to the teacher's decision boundary, converting copying into a smooth regression problem that exploits local geometry. We develop an $α$-governed smoothing and regularization scheme with Hölder/Lipschitz control over the induced target surface, and introduce two model-agnostic algorithms to estimate signed distances under label-only access. Experiments on synthetic problems and UCI benchmarks show consistent improvements in fidelity and generalization accuracy over hard-label baselines, while enabling distance outputs as uncertainty-related signals for black-box replicas.

翻译：随着数据、架构和法规的不断变化，已部署的机器学习系统必须持续演进，而在此过程中往往无法获取原始训练数据或模型内部信息。在此类场景下，黑盒复制提供了一种实用的重构机制，即仅通过输入-输出查询来学习副本，从而实现遗留模型的升级。当仅限于硬标签输出时，复制问题转化为基于离散点查询的不连续曲面重建问题，这严重限制了高效恢复决策边界几何结构的能力。本文提出一种基于距离的复制（蒸馏）框架，该框架使用到教师模型决策边界的符号距离替代硬标签监督，从而将复制问题转化为可利用局部几何特征的平滑回归问题。我们开发了一种受α参数控制的平滑与正则化方案，对诱导目标曲面进行Hölder/Lipschitz连续性控制，并提出了两种模型无关的算法来在仅标签访问条件下估计符号距离。在合成问题与UCI基准数据集上的实验表明，相较于硬标签基线方法，本方法在保真度和泛化准确率方面均取得持续提升，同时使距离输出能够作为黑盒副本中与不确定性相关的信号。

相关内容

黑盒

关注 1

在科学，计算和工程学中，黑盒是一种设备，系统或对象，可以根据其输入和输出（或传输特性）对其进行查看，而无需对其内部工作有任何了解。它的实现是“不透明的”（黑色）。几乎任何事物都可以被称为黑盒：晶体管，引擎，算法，人脑，机构或政府。为了使用典型的“黑匣子方法”来分析建模为开放系统的事物，仅考虑刺激/响应的行为，以推断（未知）盒子。该黑匣子系统的通常表示形式是在该方框中居中的数据流程图。黑盒的对立面是一个内部组件或逻辑可用于检查的系统，通常将其称为白盒（有时也称为“透明盒”或“玻璃盒”）。

AAAI 2026教程：基于离线数据集的黑盒优化

专知会员服务

16+阅读 · 1月23日

黑盒模型如何透明化？MIT博士论文《黑盒模型的可解释性和透明性技术》，207页pdf阐述可信赖机器学习路径

专知会员服务

63+阅读 · 2023年4月29日

【CVPR2023】基于强化学习的黑盒模型反演攻击

专知会员服务

24+阅读 · 2023年4月12日

打开黑盒：可解释机器学习在心脏病学中的前景和局限

专知会员服务

25+阅读 · 2022年7月22日