A Reproducible Log-Driven AutoML Framework for Interpretable Pipeline Optimization in Healthcare Risk Prediction - 专知论文

会员服务 ·

0

优化器 · AutoML · MoDELS · 数据集 · Analysis ·

A Reproducible Log-Driven AutoML Framework for Interpretable Pipeline Optimization in Healthcare Risk Prediction

翻译：暂无翻译

Rui Huang,Lican Huang

Accurate disease risk prediction is challenged by heterogeneous features, limited data, and class imbalance. This study presents yvsoucom-iterkit, a deterministic AutoML framework that models pipeline optimization as a configuration-level system with full reproducibility and traceable execution logs, enabling systematic analysis of component attribution, interactions, similarity, and cross-seed robustness. Experiments on the Pima Indians Diabetes and Stroke datasets across more than 18,000 pipeline configurations reveal a structured yet partially redundant search space, where performance is dominated by a small subset of interacting components. Ensemble models achieve stable performance, reaching a Weighted-F1 of 0.89 on Pima and 0.94 on Stroke. Macro-F1 reaches approximately 0.88 on Pima but drops to 0.6560 on Stroke due to severe imbalance. Cross-seed experiments show that ensembles reduce variance compared to single models. Friedman testing ($p < 0.05$) confirms significant ranking differences across configurations. Based on analysis of component attribution, interaction, and similarity, optimal configuration design reveals dataset-dependent behavior. For the Pima dataset, computational efficiency benefits from simplified search spaces where redundant components can be removed, with split ratio playing a key role. In contrast, the Stroke dataset requires enhanced imbalance-aware strategies, where RandomOverSampler improves Macro-F1 from 0.6560 to 0.6766. These findings demonstrate that effective AutoML optimization is achieved through optimal configuration design, where carefully constraining the search space to high-impact components can improve performance, stability, and interpretability while reducing unnecessary search complexity.

翻译：暂无翻译

0

相关内容

优化器

【博士论文】数据驱动决策：通过数据集成与预测性决策支持优化重症监护

【博士论文】数据驱动决策：通过数据集成与预测性决策支持优化重症监护

专知会员服务

20+阅读 · 2月10日

EMNLP2023：Schema自适应的知识图谱构建

EMNLP2023：Schema自适应的知识图谱构建

专知会员服务

44+阅读 · 2023年12月3日

Nature Medicine | AI与临床相结合，最新DECIDE-AI指南助力临床人工智能从开发到实施

Nature Medicine | AI与临床相结合，最新DECIDE-AI指南助力临床人工智能从开发到实施

专知会员服务

29+阅读 · 2022年5月22日

【CVPR 2022】深度安全多视图聚类:降低因视图增加而导致聚类性能下降的风险，Deep Safe Multi-view Clustering: Reducing the Risk of Clustering Performance Degradation Caused by View Increase

【CVPR 2022】深度安全多视图聚类:降低因视图增加而导致聚类性能下降的风险，Deep Safe Multi-view Clustering: Reducing the Risk of Clustering Performance Degradation Caused by View Increase

专知会员服务

10+阅读 · 2022年3月12日

【论文推荐】一种用于逆合成预测的图到图框架，A Graph to Graphs Framework for Retrosynthesis Prediction

【论文推荐】一种用于逆合成预测的图到图框架，A Graph to Graphs Framework for Retrosynthesis Prediction

专知会员服务

12+阅读 · 2020年4月1日

医学图像分割的深度学习解决方案综述

医学图像分割的深度学习解决方案综述

专知会员服务

88+阅读 · 2020年2月14日

【Nature论文】用于理解图像分类决策和改进神经网络鲁棒性的对抗性解释（Adversarial Explanations for Understanding Image Classiﬁcation Decisions and Improved Neural Network Robustness ）

【Nature论文】用于理解图像分类决策和改进神经网络鲁棒性的对抗性解释（Adversarial Explanations for Understanding Image Classiﬁcation Decisions and Improved Neural Network Robustness ）

专知会员服务

13+阅读 · 2019年11月25日

【AAAI2020接受论文】预测性参与:开放领域对话系统自动评估的有效指标（Predictive Engagement: An Efficient Metric For Automatic Evaluation of Open-Domain Dialogue Systems）

【AAAI2020接受论文】预测性参与:开放领域对话系统自动评估的有效指标（Predictive Engagement: An Efficient Metric For Automatic Evaluation of Open-Domain Dialogue Systems）

专知会员服务

14+阅读 · 2019年11月15日

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

10+阅读 · 2019年10月24日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

「深度学习医学图像关键点检测」最新2022研究综述

「深度学习医学图像关键点检测」最新2022研究综述

专知

16+阅读 · 2022年4月10日

【CVPR2021】面向通用领域自适应的领域共识聚类

【CVPR2021】面向通用领域自适应的领域共识聚类

专知

24+阅读 · 2021年5月6日

初学者系列：Attentional Factorization Machines（AFM）详解

初学者系列：Attentional Factorization Machines（AFM）详解

专知

82+阅读 · 2019年9月16日

告别调参，AutoML新书发布

告别调参，AutoML新书发布

专知

14+阅读 · 2018年10月16日

PyTorch 中使用深度学习（CNN和LSTM）的自动图像捕获

PyTorch 中使用深度学习（CNN和LSTM）的自动图像捕获

AI研习社

40+阅读 · 2018年9月21日

《pyramid Attention Network for Semantic Segmentation》

《pyramid Attention Network for Semantic Segmentation》

统计学习与视觉计算组

44+阅读 · 2018年8月30日

Single-Shot Object Detection with Enriched Semantics

Single-Shot Object Detection with Enriched Semantics

统计学习与视觉计算组

14+阅读 · 2018年8月29日

Focal Loss for Dense Object Detection

Focal Loss for Dense Object Detection

统计学习与视觉计算组

12+阅读 · 2018年3月15日

论文浅尝 | Improved Neural Relation Detection for KBQA

论文浅尝 | Improved Neural Relation Detection for KBQA

开放知识图谱

13+阅读 · 2018年1月21日

论文浅尝 | Question Answering over Freebase

论文浅尝 | Question Answering over Freebase

开放知识图谱

19+阅读 · 2018年1月9日

多视角识别长非编码RNA和人类复杂疾病关联预测研究

国家自然科学基金

4+阅读 · 2017年12月31日

相互关联研发网络上风险级联传播建模及控制方法研究

国家自然科学基金

1+阅读 · 2015年12月31日

基于网络的复杂疾病动态表观修饰模块挖掘

国家自然科学基金

0+阅读 · 2015年12月31日

基于生存树的急性心肌梗死早期预警及其多生理参数建模

国家自然科学基金

0+阅读 · 2015年12月31日

面向帕金森病的多模态在线预警方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

面向全生命周期的医疗保健资源供需匹配模式设计与优化研究

国家自然科学基金

1+阅读 · 2014年12月31日

血管稳态与重构的调控机制

国家自然科学基金

0+阅读 · 2014年12月31日

自媒体环境下医患关系突发事件网络舆情演化与危机预警研究

国家自然科学基金

1+阅读 · 2014年12月31日

相继故障视角下基于风险传播模型的研发网络脆弱性研究

国家自然科学基金

0+阅读 · 2014年12月31日

基于公立医院动态人本化管理的医患冲突预警和干预模式构建

国家自然科学基金

1+阅读 · 2014年12月31日

ChronoSurv: A Clinical Pathway-Guided Graph Framework for Multimodal Survival Analysis

Arxiv

0+阅读 · 6月17日

MedicalAgentsBench for Complex Medical Reasoning: Comparing Internalized Reasoning Models versus Externalized Agent-based Frameworks

Arxiv

0+阅读 · 6月16日

Expert-Driven Survival Machines: Improving Stratification and Interpretability in Multiple Clinical Cohorts

Arxiv

0+阅读 · 6月12日

GraphPINE: Graph Importance Propagation for Interpretable Drug Response Prediction

Arxiv

0+阅读 · 5月19日

An Automated Framework for Large-Scale Graph-Based Cerebrovascular Analysis

Arxiv

0+阅读 · 5月19日

DRReduce: Enhancing Syntax-Guided Program Reduction with Dependency Reconstruction

Arxiv

0+阅读 · 5月19日

Shape-Adaptive Conditional Calibration for Conformal Prediction via Minimax Optimization

Arxiv

0+阅读 · 5月12日

ADAPTS: Agentic Decomposition for Automated Protocol-agnostic Tracking of Symptoms

Arxiv

0+阅读 · 5月6日

Graph-Based Deep Learning for Medical Diagnosis and Analysis: Past, Present and Future

Graph-Based Deep Learning for Medical Diagnosis and Analysis: Past, Present and Future

Arxiv

36+阅读 · 2021年5月27日

Generative Adversarial Autoencoder Networks

Arxiv

11+阅读 · 2018年3月23日

VIP会员

文章信息

相关主题

最新内容

《廉价自杀式无人机战争的军事战略影响：乌克兰和伊朗案例研究》

《廉价自杀式无人机战争的军事战略影响：乌克兰和伊朗案例研究》

专知会员服务

7+阅读 · 今天2:06

《面向反无人机作战的联邦式可解释射频–光电/红外情报融合：边缘人工智能优化、电子战韧性及分布式监视验证》

《面向反无人机作战的联邦式可解释射频–光电/红外情报融合：边缘人工智能优化、电子战韧性及分布式监视验证》

专知会员服务

5+阅读 · 今天1:37

ICML 2026 | FR3D：解耦自车运动的未来动态三维重建世界模型

ICML 2026 | FR3D：解耦自车运动的未来动态三维重建世界模型

专知会员服务

3+阅读 · 6月17日

【伯克利博士论文】迈向可扩展与自我演进的大语言模型智能体

【伯克利博士论文】迈向可扩展与自我演进的大语言模型智能体

专知会员服务

5+阅读 · 6月17日

学习数据的几何：形状空间分析数学综述

学习数据的几何：形状空间分析数学综述

专知会员服务

4+阅读 · 6月17日

《现代防空系统综述：架构、传感器、拦截器及新兴威胁环境对基础设施受限防御环境的影响》2026最新长综述

《现代防空系统综述：架构、传感器、拦截器及新兴威胁环境对基础设施受限防御环境的影响》2026最新长综述

专知会员服务

7+阅读 · 6月17日

定向能反无人机系统最新发展动态

定向能反无人机系统最新发展动态

专知会员服务

7+阅读 · 6月17日

从燃煤战舰到算法战争：水面指挥的永恒要求

从燃煤战舰到算法战争：水面指挥的永恒要求

专知会员服务

4+阅读 · 6月17日

《短程弹道再入飞行器拦截时间中的一项异常现象》

《短程弹道再入飞行器拦截时间中的一项异常现象》

专知会员服务

6+阅读 · 6月17日

《基于回归方法与任务上下文的对抗环境动态战术网络报文优先级排序》

《基于回归方法与任务上下文的对抗环境动态战术网络报文优先级排序》

专知会员服务

6+阅读 · 6月17日

美智库《战术级指挥控制的迫切要求：构建弹性机动式指挥控制网络》报告

美智库《战术级指挥控制的迫切要求：构建弹性机动式指挥控制网络》报告

专知会员服务

5+阅读 · 6月17日

《韩国国防政策与军备出口：韩国安全与国防政策如何塑造其国防工业与军备出口格局》最新100页报告

《韩国国防政策与军备出口：韩国安全与国防政策如何塑造其国防工业与军备出口格局》最新100页报告

专知会员服务

4+阅读 · 6月17日

ICML 2026 | VOTP：用视频基础模型与最优传输，让离线偏好强化学习只需少量反馈

ICML 2026 | VOTP：用视频基础模型与最优传输，让离线偏好强化学习只需少量反馈

专知会员服务

6+阅读 · 6月16日

多模态代码智能综述：从视觉输入到可执行代码系统

多模态代码智能综述：从视觉输入到可执行代码系统

专知会员服务

8+阅读 · 6月16日

美国马六甲“三重网”概念：安全网、威慑网与杀伤网

美国马六甲“三重网”概念：安全网、威慑网与杀伤网

专知会员服务

6+阅读 · 6月16日

相关VIP内容

【博士论文】数据驱动决策：通过数据集成与预测性决策支持优化重症监护

【博士论文】数据驱动决策：通过数据集成与预测性决策支持优化重症监护

专知会员服务

20+阅读 · 2月10日

EMNLP2023：Schema自适应的知识图谱构建

EMNLP2023：Schema自适应的知识图谱构建

专知会员服务

44+阅读 · 2023年12月3日

Nature Medicine | AI与临床相结合，最新DECIDE-AI指南助力临床人工智能从开发到实施

Nature Medicine | AI与临床相结合，最新DECIDE-AI指南助力临床人工智能从开发到实施

专知会员服务

29+阅读 · 2022年5月22日

【CVPR 2022】深度安全多视图聚类:降低因视图增加而导致聚类性能下降的风险，Deep Safe Multi-view Clustering: Reducing the Risk of Clustering Performance Degradation Caused by View Increase

【CVPR 2022】深度安全多视图聚类:降低因视图增加而导致聚类性能下降的风险，Deep Safe Multi-view Clustering: Reducing the Risk of Clustering Performance Degradation Caused by View Increase

专知会员服务

10+阅读 · 2022年3月12日

【论文推荐】一种用于逆合成预测的图到图框架，A Graph to Graphs Framework for Retrosynthesis Prediction

【论文推荐】一种用于逆合成预测的图到图框架，A Graph to Graphs Framework for Retrosynthesis Prediction

专知会员服务

12+阅读 · 2020年4月1日

医学图像分割的深度学习解决方案综述

医学图像分割的深度学习解决方案综述

专知会员服务

88+阅读 · 2020年2月14日

【Nature论文】用于理解图像分类决策和改进神经网络鲁棒性的对抗性解释（Adversarial Explanations for Understanding Image Classiﬁcation Decisions and Improved Neural Network Robustness ）

【Nature论文】用于理解图像分类决策和改进神经网络鲁棒性的对抗性解释（Adversarial Explanations for Understanding Image Classiﬁcation Decisions and Improved Neural Network Robustness ）

专知会员服务

13+阅读 · 2019年11月25日

【AAAI2020接受论文】预测性参与:开放领域对话系统自动评估的有效指标（Predictive Engagement: An Efficient Metric For Automatic Evaluation of Open-Domain Dialogue Systems）

【AAAI2020接受论文】预测性参与:开放领域对话系统自动评估的有效指标（Predictive Engagement: An Efficient Metric For Automatic Evaluation of Open-Domain Dialogue Systems）

专知会员服务

14+阅读 · 2019年11月15日

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

10+阅读 · 2019年10月24日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

热门VIP内容

开通专知VIP会员享更多权益服务

《面向反无人机作战的联邦式可解释射频–光电/红外情报融合：边缘人工智能优化、电子战韧性及分布式监视验证》

【伯克利博士论文】迈向可扩展与自我演进的大语言模型智能体

《廉价自杀式无人机战争的军事战略影响：乌克兰和伊朗案例研究》

ICML 2026 | FR3D：解耦自车运动的未来动态三维重建世界模型

相关资讯

「深度学习医学图像关键点检测」最新2022研究综述

「深度学习医学图像关键点检测」最新2022研究综述

专知

16+阅读 · 2022年4月10日

【CVPR2021】面向通用领域自适应的领域共识聚类

【CVPR2021】面向通用领域自适应的领域共识聚类

专知

24+阅读 · 2021年5月6日

初学者系列：Attentional Factorization Machines（AFM）详解

初学者系列：Attentional Factorization Machines（AFM）详解

专知

82+阅读 · 2019年9月16日

告别调参，AutoML新书发布

告别调参，AutoML新书发布

专知

14+阅读 · 2018年10月16日

PyTorch 中使用深度学习（CNN和LSTM）的自动图像捕获

PyTorch 中使用深度学习（CNN和LSTM）的自动图像捕获

AI研习社

40+阅读 · 2018年9月21日

《pyramid Attention Network for Semantic Segmentation》

《pyramid Attention Network for Semantic Segmentation》

统计学习与视觉计算组

44+阅读 · 2018年8月30日

Single-Shot Object Detection with Enriched Semantics

Single-Shot Object Detection with Enriched Semantics

统计学习与视觉计算组

14+阅读 · 2018年8月29日

Focal Loss for Dense Object Detection

Focal Loss for Dense Object Detection

统计学习与视觉计算组

12+阅读 · 2018年3月15日

论文浅尝 | Improved Neural Relation Detection for KBQA

论文浅尝 | Improved Neural Relation Detection for KBQA

开放知识图谱

13+阅读 · 2018年1月21日

论文浅尝 | Question Answering over Freebase

论文浅尝 | Question Answering over Freebase

开放知识图谱

19+阅读 · 2018年1月9日

相关论文

ChronoSurv: A Clinical Pathway-Guided Graph Framework for Multimodal Survival Analysis

Arxiv

0+阅读 · 6月17日

MedicalAgentsBench for Complex Medical Reasoning: Comparing Internalized Reasoning Models versus Externalized Agent-based Frameworks

Arxiv

0+阅读 · 6月16日

Expert-Driven Survival Machines: Improving Stratification and Interpretability in Multiple Clinical Cohorts

Arxiv

0+阅读 · 6月12日

GraphPINE: Graph Importance Propagation for Interpretable Drug Response Prediction

Arxiv

0+阅读 · 5月19日

An Automated Framework for Large-Scale Graph-Based Cerebrovascular Analysis

Arxiv

0+阅读 · 5月19日

DRReduce: Enhancing Syntax-Guided Program Reduction with Dependency Reconstruction

Arxiv

0+阅读 · 5月19日

Shape-Adaptive Conditional Calibration for Conformal Prediction via Minimax Optimization

Arxiv

0+阅读 · 5月12日

ADAPTS: Agentic Decomposition for Automated Protocol-agnostic Tracking of Symptoms

Arxiv

0+阅读 · 5月6日

Graph-Based Deep Learning for Medical Diagnosis and Analysis: Past, Present and Future

Graph-Based Deep Learning for Medical Diagnosis and Analysis: Past, Present and Future

Arxiv

36+阅读 · 2021年5月27日

Generative Adversarial Autoencoder Networks

Arxiv

11+阅读 · 2018年3月23日

相关基金

多视角识别长非编码RNA和人类复杂疾病关联预测研究

国家自然科学基金

4+阅读 · 2017年12月31日

相互关联研发网络上风险级联传播建模及控制方法研究

国家自然科学基金

1+阅读 · 2015年12月31日

基于网络的复杂疾病动态表观修饰模块挖掘

国家自然科学基金

0+阅读 · 2015年12月31日

基于生存树的急性心肌梗死早期预警及其多生理参数建模

国家自然科学基金

0+阅读 · 2015年12月31日

面向帕金森病的多模态在线预警方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

面向全生命周期的医疗保健资源供需匹配模式设计与优化研究

国家自然科学基金

1+阅读 · 2014年12月31日

血管稳态与重构的调控机制

国家自然科学基金

0+阅读 · 2014年12月31日

自媒体环境下医患关系突发事件网络舆情演化与危机预警研究

国家自然科学基金

1+阅读 · 2014年12月31日

相继故障视角下基于风险传播模型的研发网络脆弱性研究

国家自然科学基金

0+阅读 · 2014年12月31日

基于公立医院动态人本化管理的医患冲突预警和干预模式构建

国家自然科学基金

1+阅读 · 2014年12月31日

微信扫码咨询专知VIP会员