An Assessment of Human vs. Model Uncertainty in Soft-Label Learning and Calibration

Central to human-aligned AI is understanding the benefits of human-elicited labels over synthetic alternatives. While human soft-labels improve calibration by capturing uncertainty, prior studies conflate these benefits with the implicit correction of mislabeled data (mode shifts), obscuring true effects of soft-labels. We present a controlled audit of soft-label learning across MNIST and a synthetic variant, re-annotating subsets to extract human uncertainty. By decoupling soft-label supervision from underlying label mode shifts, we show that while human soft-labels do provide accuracy gains, their larger value lies in acting as a regularizer that improves model calibration on difficult samples and promotes stable convergence across training runs. Dataset cartography reveals models trained on human soft-labels mirror human uncertainty, whereas those trained on synthetic labels fail to align with humans. Broadly, this work provides a diagnostic testbed for human-AI uncertainty alignment.

翻译：实现与人类对齐的人工智能，关键在于理解人类标注相较于合成替代方案的益处。尽管人类软标签通过捕捉不确定性改善了模型校准，但先前的研究将这些益处与错误标注数据（模式偏移）的隐式修正混为一谈，从而模糊了软标签的真实效果。我们在MNIST数据集及其合成变体上对软标签学习进行了受控审计，通过对子集重新标注来提取人类不确定性。通过将软标签监督与底层标签模式偏移解耦，我们证明人类软标签虽能带来准确率的提升，但其更大价值在于充当正则化项：既能改善模型在困难样本上的校准效果，又能促进不同训练轮次间的稳定收敛。数据集图谱显示，基于人类软标签训练的模型能够镜像人类不确定性，而基于合成标签训练的模型则无法与人类对齐。总体而言，本研究为人类-人工智能不确定性对齐提供了诊断性测试平台。

相关内容

MoDELS

关注 46

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

《人机协作集成模型中的不确定性捕获》博士论文

专知会员服务

25+阅读 · 2025年10月2日

【博士论文】小型和大型模型的不确定性估计

专知会员服务

21+阅读 · 2025年7月11日

【ICML2025】通过概念对齐与混淆感知校准边界处理视觉-语言模型中的伪标签不平衡问题

专知会员服务

11+阅读 · 2025年5月6日

【博士论文】语言模型与人类偏好对齐，148页pdf

专知会员服务

32+阅读 · 2024年4月21日