稳定但错误：当更多数据损害科学结论时 (Stable but Wrong: When More Data Degrades Scientific Conclusions) - 专知论文

会员服务 ·

0

积累 · 可逆 · 失效 · 识别 · 结构 ·

Stable but Wrong: When More Data Degrades Scientific Conclusions

翻译：稳定但错误：当更多数据损害科学结论时

Zhipeng Zhang,Kai Li

Modern science increasingly relies on ever-growing observational datasets and automated inference pipelines, under the implicit belief that accumulating more data makes scientific conclusions more reliable. Here we show that this belief can fail in a fundamental and irreversible way. We identify a structural regime in which standard inference procedures converge smoothly, remain well calibrated, and pass conventional diagnostic checks, yet systematically converge to incorrect conclusions. This failure arises when the reliability of observations degrades in a manner that is intrinsically unobservable to the inference process itself. Using minimal synthetic experiments, we demonstrate that in this regime additional data do not correct error but instead amplify it, while residual-based and goodness-of-fit diagnostics remain misleadingly normal. These results reveal an intrinsic limit of data-driven science: stability, convergence, and confidence are not sufficient indicators of epistemic validity. We argue that inference cannot be treated as an unconditional consequence of data availability, but must instead be governed by explicit constraints on the integrity of the observational process.

翻译：现代科学日益依赖不断增长的观测数据集和自动化推理流程，其隐含信念是：积累更多数据会使科学结论更可靠。本文揭示这一信念可能以根本且不可逆的方式失效。我们识别出一种结构性机制，其中标准推理程序能平滑收敛、保持良好校准并通过常规诊断检验，却系统性地收敛至错误结论。这种失败发生在观测可靠性以推理过程本身无法观测的方式退化时。通过最小化合成实验，我们证明在此机制下，额外数据不仅无法修正错误，反而会放大误差，而基于残差和拟合优度的诊断指标仍保持误导性的正常状态。这些结果揭示了数据驱动科学的内在局限：稳定性、收敛性和置信度并非认知有效性的充分指标。我们认为，推理不能被视为数据可用性的无条件结果，而必须受到对观测过程完整性的显式约束所支配。

0

相关内容

【CMU博士论文】基于机器学习的可信科学推理

【CMU博士论文】基于机器学习的可信科学推理

专知会员服务

16+阅读 · 2025年5月26日

【新书】数据科学中的因果推断，638页pdf

【新书】数据科学中的因果推断，638页pdf

专知会员服务

80+阅读 · 2025年2月19日

《因果性与机器学习综述》2022最新40页报告，美国陆军研究实验室

《因果性与机器学习综述》2022最新40页报告，美国陆军研究实验室

专知会员服务

120+阅读 · 2022年11月24日

最新《因果推断导论》，51页ppt，剑桥大学助理教授Qingyuan Zhao讲解

最新《因果推断导论》，51页ppt，剑桥大学助理教授Qingyuan Zhao讲解

专知会员服务

41+阅读 · 2022年8月28日

【经典书】计算和推理思维:数据科学的基础，631页pdf

专知会员服务

74+阅读 · 2021年10月15日

因果推断，Causal Inference：The Mixtape

因果推断，Causal Inference：The Mixtape

专知会员服务

110+阅读 · 2021年8月27日

Nature计算科学综述：经由准实验从观察数据中推测因果关系

Nature计算科学综述：经由准实验从观察数据中推测因果关系

专知会员服务

36+阅读 · 2021年3月25日

最新《从观察数据发现因果性》，150页ppt

专知会员服务

66+阅读 · 2021年1月6日

【UC伯克利郁彬教授PNAS最新论文】真实数据科学，Veridical data science

【UC伯克利郁彬教授PNAS最新论文】真实数据科学，Veridical data science

专知会员服务

49+阅读 · 2020年2月21日

最新「因果推断Causal Inference」综述论文38页pdf，Buffalo、Georgia、阿里巴巴、Virginia

专知会员服务

183+阅读 · 2020年2月11日

《因果性与机器学习综述》2022最新40页报告，美国陆军研究实验室

《因果性与机器学习综述》2022最新40页报告，美国陆军研究实验室

专知

12+阅读 · 2022年11月25日

机器学习的可解释性：因果推理和稳定学习

机器学习的可解释性：因果推理和稳定学习

DataFunTalk

13+阅读 · 2020年3月3日

最新「因果推断Causal Inference」综述论文38页pdf，阿里巴巴、Buffalo、Georgia、Virginia

最新「因果推断Causal Inference」综述论文38页pdf，阿里巴巴、Buffalo、Georgia、Virginia

专知

68+阅读 · 2020年2月11日

您可以相信模型的不确定性吗？

您可以相信模型的不确定性吗？

TensorFlow

14+阅读 · 2020年1月31日

「PPT」深度学习中的不确定性估计

「PPT」深度学习中的不确定性估计

专知

27+阅读 · 2019年7月20日

你的算法可靠吗？神经网络不确定性度量

你的算法可靠吗？神经网络不确定性度量

专知

40+阅读 · 2019年4月27日

【Awesome】最全的机器学习可解释性资料（machine-learning-interpretability）

【Awesome】最全的机器学习可解释性资料（machine-learning-interpretability）

专知

29+阅读 · 2019年3月1日

【UC伯克利郁彬老师最新论文】数据科学的三原则：可预测性、可计算、稳定性

【UC伯克利郁彬老师最新论文】数据科学的三原则：可预测性、可计算、稳定性

专知

12+阅读 · 2019年1月25日

用模型不确定性理解模型

用模型不确定性理解模型

论智

11+阅读 · 2018年9月5日

赛尔原创 | ACM BCB 2018 CausalTriad: 从医学文本数据中推断出新的因果关系假设

赛尔原创 | ACM BCB 2018 CausalTriad: 从医学文本数据中推断出新的因果关系假设

哈工大SCIR

14+阅读 · 2018年6月28日

测量误差数据下部分线性模型有约束统计推断理论

国家自然科学基金

2+阅读 · 2015年12月31日

逻辑等价算子在不确定性推理中的应用

国家自然科学基金

1+阅读 · 2015年12月31日

基于云计算平台的下一代测序数据错误修正算法研究与实现

国家自然科学基金

2+阅读 · 2015年12月31日

稳健随机均值模型在时空数据分析中的应用

国家自然科学基金

1+阅读 · 2014年12月31日

复杂数据下带有形状约束的半参数模型统计推断

国家自然科学基金

3+阅读 · 2014年12月31日

信息论学习中的正则化及相关高维数据分析方法的数学理论

国家自然科学基金

12+阅读 · 2014年12月31日

高维混合数据异常知识发现的粒计算模型关键问题研究

国家自然科学基金

1+阅读 · 2014年12月31日

含有隐变量的因果结构学习与统计因果推断

国家自然科学基金

21+阅读 · 2013年12月31日

不确定性推理与语义网中知识表示的数学基础

国家自然科学基金

18+阅读 · 2012年12月31日

因果推断及不完全数据的统计分析

国家自然科学基金

23+阅读 · 2008年12月31日

When to Trust the Cheap Check: Weak and Strong Verification for Reasoning

When to Trust the Cheap Check: Weak and Strong Verification for Reasoning

Arxiv

0+阅读 · 2月19日

Calibrated and uncertain? Evaluating uncertainty estimates in binary classification models

Arxiv

0+阅读 · 2月17日

DiffuReason: Bridging Latent Reasoning and Generative Refinement for Sequential Recommendation

Arxiv

0+阅读 · 2月10日

The Refutability Gap: Challenges in Validating Reasoning by Large Language Models

Arxiv

0+阅读 · 2月9日

Reducing Aleatoric and Epistemic Uncertainty through Multi-modal Data Acquisition

Arxiv

0+阅读 · 2月9日

One Size Does NOT Fit All: On the Importance of Physical Representations for Datalog Evaluation

Arxiv

0+阅读 · 2月5日

When Do Credal Sets Stabilize? Fixed-Point Theorems for Credal Set Updates

Arxiv

0+阅读 · 2月4日

Physics as the Inductive Bias for Causal Discovery

Arxiv

0+阅读 · 2月3日

Information-Theoretic Causal Bounds under Unmeasured Confounding

Arxiv

0+阅读 · 2月3日

Structure Enables Effective Self-Localization of Errors in LLMs

Arxiv

0+阅读 · 2月2日

VIP会员

文章信息

相关主题

相关VIP内容

【CMU博士论文】基于机器学习的可信科学推理

【CMU博士论文】基于机器学习的可信科学推理

专知会员服务

16+阅读 · 2025年5月26日

【新书】数据科学中的因果推断，638页pdf

【新书】数据科学中的因果推断，638页pdf

专知会员服务

80+阅读 · 2025年2月19日

《因果性与机器学习综述》2022最新40页报告，美国陆军研究实验室

《因果性与机器学习综述》2022最新40页报告，美国陆军研究实验室

专知会员服务

120+阅读 · 2022年11月24日

最新《因果推断导论》，51页ppt，剑桥大学助理教授Qingyuan Zhao讲解

最新《因果推断导论》，51页ppt，剑桥大学助理教授Qingyuan Zhao讲解

专知会员服务

41+阅读 · 2022年8月28日

【经典书】计算和推理思维:数据科学的基础，631页pdf

专知会员服务

74+阅读 · 2021年10月15日

因果推断，Causal Inference：The Mixtape

因果推断，Causal Inference：The Mixtape

专知会员服务

110+阅读 · 2021年8月27日

Nature计算科学综述：经由准实验从观察数据中推测因果关系

Nature计算科学综述：经由准实验从观察数据中推测因果关系

专知会员服务

36+阅读 · 2021年3月25日

最新《从观察数据发现因果性》，150页ppt

专知会员服务

66+阅读 · 2021年1月6日

【UC伯克利郁彬教授PNAS最新论文】真实数据科学，Veridical data science

【UC伯克利郁彬教授PNAS最新论文】真实数据科学，Veridical data science

专知会员服务

49+阅读 · 2020年2月21日

最新「因果推断Causal Inference」综述论文38页pdf，Buffalo、Georgia、阿里巴巴、Virginia

专知会员服务

183+阅读 · 2020年2月11日

热门VIP内容

开通专知VIP会员享更多权益服务

《无人机与战争：被忽视的环境影响及无人机保护潜力》

俄罗斯规划未来无人机驱动军队

《整合杀伤链：一个用于边缘目标验证与战术推理的零样本框架》最新资料

《人工智能、武器与影响力：前沿模型在模拟核危机中展现复杂推理》2026最新46页报告

相关资讯

《因果性与机器学习综述》2022最新40页报告，美国陆军研究实验室

《因果性与机器学习综述》2022最新40页报告，美国陆军研究实验室

专知

12+阅读 · 2022年11月25日

机器学习的可解释性：因果推理和稳定学习

机器学习的可解释性：因果推理和稳定学习

DataFunTalk

13+阅读 · 2020年3月3日

最新「因果推断Causal Inference」综述论文38页pdf，阿里巴巴、Buffalo、Georgia、Virginia

最新「因果推断Causal Inference」综述论文38页pdf，阿里巴巴、Buffalo、Georgia、Virginia

专知

68+阅读 · 2020年2月11日

您可以相信模型的不确定性吗？

您可以相信模型的不确定性吗？

TensorFlow

14+阅读 · 2020年1月31日

「PPT」深度学习中的不确定性估计

「PPT」深度学习中的不确定性估计

专知

27+阅读 · 2019年7月20日

你的算法可靠吗？神经网络不确定性度量

你的算法可靠吗？神经网络不确定性度量

专知

40+阅读 · 2019年4月27日

【Awesome】最全的机器学习可解释性资料（machine-learning-interpretability）

【Awesome】最全的机器学习可解释性资料（machine-learning-interpretability）

专知

29+阅读 · 2019年3月1日

【UC伯克利郁彬老师最新论文】数据科学的三原则：可预测性、可计算、稳定性

【UC伯克利郁彬老师最新论文】数据科学的三原则：可预测性、可计算、稳定性

专知

12+阅读 · 2019年1月25日

用模型不确定性理解模型

用模型不确定性理解模型

论智

11+阅读 · 2018年9月5日

赛尔原创 | ACM BCB 2018 CausalTriad: 从医学文本数据中推断出新的因果关系假设

赛尔原创 | ACM BCB 2018 CausalTriad: 从医学文本数据中推断出新的因果关系假设

哈工大SCIR

14+阅读 · 2018年6月28日

相关论文

When to Trust the Cheap Check: Weak and Strong Verification for Reasoning

When to Trust the Cheap Check: Weak and Strong Verification for Reasoning

Arxiv

0+阅读 · 2月19日

Calibrated and uncertain? Evaluating uncertainty estimates in binary classification models

Arxiv

0+阅读 · 2月17日

DiffuReason: Bridging Latent Reasoning and Generative Refinement for Sequential Recommendation

Arxiv

0+阅读 · 2月10日

The Refutability Gap: Challenges in Validating Reasoning by Large Language Models

Arxiv

0+阅读 · 2月9日

Reducing Aleatoric and Epistemic Uncertainty through Multi-modal Data Acquisition

Arxiv

0+阅读 · 2月9日

One Size Does NOT Fit All: On the Importance of Physical Representations for Datalog Evaluation

Arxiv

0+阅读 · 2月5日

When Do Credal Sets Stabilize? Fixed-Point Theorems for Credal Set Updates

Arxiv

0+阅读 · 2月4日

Physics as the Inductive Bias for Causal Discovery

Arxiv

0+阅读 · 2月3日

Information-Theoretic Causal Bounds under Unmeasured Confounding

Arxiv

0+阅读 · 2月3日

Structure Enables Effective Self-Localization of Errors in LLMs

Arxiv

0+阅读 · 2月2日

相关基金

测量误差数据下部分线性模型有约束统计推断理论

国家自然科学基金

2+阅读 · 2015年12月31日

逻辑等价算子在不确定性推理中的应用

国家自然科学基金

1+阅读 · 2015年12月31日

基于云计算平台的下一代测序数据错误修正算法研究与实现

国家自然科学基金

2+阅读 · 2015年12月31日

稳健随机均值模型在时空数据分析中的应用

国家自然科学基金

1+阅读 · 2014年12月31日

复杂数据下带有形状约束的半参数模型统计推断

国家自然科学基金

3+阅读 · 2014年12月31日

信息论学习中的正则化及相关高维数据分析方法的数学理论

国家自然科学基金

12+阅读 · 2014年12月31日

高维混合数据异常知识发现的粒计算模型关键问题研究

国家自然科学基金

1+阅读 · 2014年12月31日

含有隐变量的因果结构学习与统计因果推断

国家自然科学基金

21+阅读 · 2013年12月31日

不确定性推理与语义网中知识表示的数学基础

国家自然科学基金

18+阅读 · 2012年12月31日

因果推断及不完全数据的统计分析

国家自然科学基金

23+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员