A Path to Simpler Models Starts With Noise

The Rashomon set is the set of models that perform approximately equally well on a given dataset, and the Rashomon ratio is the fraction of all models in a given hypothesis space that are in the Rashomon set. Rashomon ratios are often large for tabular datasets in criminal justice, healthcare, lending, education, and in other areas, which has practical implications about whether simpler models can attain the same level of accuracy as more complex models. An open question is why Rashomon ratios often tend to be large. In this work, we propose and study a mechanism of the data generation process, coupled with choices usually made by the analyst during the learning process, that determines the size of the Rashomon ratio. Specifically, we demonstrate that noisier datasets lead to larger Rashomon ratios through the way that practitioners train models. Additionally, we introduce a measure called pattern diversity, which captures the average difference in predictions between distinct classification patterns in the Rashomon set, and motivate why it tends to increase with label noise. Our results explain a key aspect of why simpler models often tend to perform as well as black box models on complex, noisier datasets.

翻译：拉什蒙集是指给定数据集上性能大致相等的模型集合，而拉什蒙比则是给定假设空间中属于拉什蒙集的模型比例。在刑事司法、医疗、借贷、教育及其他领域的表格数据集中，拉什蒙比通常较大，这一现象具有实际意义——它关乎更简单的模型能否达到与复杂模型同等水平的准确率。一个悬而未决的问题是：为何拉什蒙比往往趋于较大值？本文提出并研究了数据生成过程中的一种机制，结合分析师在学习过程中通常做出的选择，共同决定了拉什蒙比的大小。具体而言，我们通过实践者训练模型的方式证明：噪声更大的数据集会导致更大的拉什蒙比。此外，我们引入了一种名为“模式多样性”的度量指标，用于捕捉拉什蒙集中不同分类模式预测结果的平均差异，并论证了该指标为何倾向于随标签噪声增大而增加。我们的研究结果揭示了为何在复杂、高噪声数据集上，更简单的模型通常能表现与黑箱模型相当的关键原因。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

【亚马逊-WWW2020】不解析,生成!用于面向任务的语义分析的序列到序列体系结构，Don't Parse, Generate! A Sequence to Sequence Architecture for Task-Oriented Semantic Parsing

专知会员服务

15+阅读 · 2020年2月1日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日