RobustiPy: An efficient next generation multiversal library with model selection, averaging, resampling, and explainable artificial intelligence

Scientific inference is often undermined by the vast but rarely explored "multiverse" of defensible modelling choices, which can generate results as variable as the phenomena under study. We introduce RobustiPy, an open-source Python library that systematizes multiverse analysis and model-uncertainty quantification at scale. RobustiPy unifies bootstrap-based inference, combinatorial specification search, model selection and averaging, joint-inference routines, and explainable AI methods within a modular, reproducible framework. Beyond exhaustive specification curves, it supports rigorous out-of-sample validation and quantifies the marginal contribution of each covariate. We demonstrate its utility across five simulation designs and ten empirical case studies spanning economics, sociology, psychology, and medicine, including a re-analysis of widely cited findings with documented discrepancies. Benchmarking on ~672 million simulated regressions shows that RobustiPy delivers state-of-the-art computational efficiency while expanding transparency in empirical research. By standardizing and accelerating robustness analysis, RobustiPy transforms how researchers interrogate sensitivity across the analytical multiverse, offering a practical foundation for more reproducible and interpretable computational science.

翻译：科学推理常因存在大量却鲜被系统探索的"多宇宙"（即多种可辩护的建模选择）而受到损害，这些选择可能产生与研究现象本身同样多样的结果。我们介绍了RobustiPy，一个开源的Python库，它能够大规模系统化地进行多宇宙分析与模型不确定性量化。RobustiPy在一个模块化、可复现的框架内，统一了基于自助法的推断、组合式规范搜索、模型选择与平均化、联合推断程序以及可解释人工智能方法。除了生成完备的规范曲线外，该库还支持严格的样本外验证，并能量化每个协变量的边际贡献。我们通过五个模拟实验设计和十个涵盖经济学、社会学、心理学和医学的实证案例研究（包括对存在已记录差异的广为人知发现的重新分析）展示了其实用性。在约6.72亿次模拟回归上的基准测试表明，RobustiPy在提升实证研究透明度的同时，实现了最先进的计算效率。通过标准化和加速稳健性分析，RobustiPy改变了研究人员在分析多宇宙中审视敏感性的方式，为提升计算科学的可复现性与可解释性提供了实用基础。

相关内容

MoDELS

关注 46

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

OpenEarthAgent：一种面向工具增强型地理空间智能体的统一框架

专知会员服务

16+阅读 · 2月20日