Uncovering the effects of model initialization on deep model generalization: A study with adult and pediatric Chest X-ray images

Model initialization techniques are vital for improving the performance and reliability of deep learning models in medical computer vision applications. While much literature exists on non-medical images, the impacts on medical images, particularly chest X-rays (CXRs) are less understood. Addressing this gap, our study explores three deep model initialization techniques: Cold-start, Warm-start, and Shrink and Perturb start, focusing on adult and pediatric populations. We specifically focus on scenarios with periodically arriving data for training, thereby embracing the real-world scenarios of ongoing data influx and the need for model updates. We evaluate these models for generalizability against external adult and pediatric CXR datasets. We also propose novel ensemble methods: F-score-weighted Sequential Least-Squares Quadratic Programming (F-SLSQP) and Attention-Guided Ensembles with Learnable Fuzzy Softmax to aggregate weight parameters from multiple models to capitalize on their collective knowledge and complementary representations. We perform statistical significance tests with 95% confidence intervals and p-values to analyze model performance. Our evaluations indicate models initialized with ImageNet-pre-trained weights demonstrate superior generalizability over randomly initialized counterparts, contradicting some findings for non-medical images. Notably, ImageNet-pretrained models exhibit consistent performance during internal and external testing across different training scenarios. Weight-level ensembles of these models show significantly higher recall (p<0.05) during testing compared to individual models. Thus, our study accentuates the benefits of ImageNet-pretrained weight initialization, especially when used with weight-level ensembles, for creating robust and generalizable deep learning solutions.

翻译：模型初始化技术对于提升医学计算机视觉应用中深度学习模型的性能与可靠性至关重要。尽管非医学图像领域已有大量文献，但其对医学图像（尤其是胸部X光图像）的影响尚不明确。为填补这一研究空白，本研究探讨了三种深度模型初始化技术：冷启动、热启动以及收缩扰动启动，并聚焦成人与儿童群体。我们特别关注周期性到达的训练数据场景，从而适配真实世界中数据持续涌入与模型更新的需求。通过外部成人与儿童胸部X光数据集评估这些模型的泛化能力。同时，我们提出新型集成方法：基于F分数加权序列最小二乘二次规划（F-SLSQP）与注意力引导可学习模糊Softmax集成，以聚合多个模型的权重参数，充分利用其集体知识与互补表征。采用95%置信区间与p值的统计显著性检验分析模型性能。评估结果表明，使用ImageNet预训练权重初始化的模型在泛化能力上显著优于随机初始化模型，这与非医学图像领域的部分发现相悖。值得注意的是，ImageNet预训练模型在不同训练场景的内部与外部测试中均表现出一致性能。相较单一模型，此类模型的权重级集成在测试中展现出显著更高的召回率（p<0.05）。因此，本研究凸显了ImageNet预训练权重初始化的优势，尤其当结合权重级集成时，可构建稳健且具备泛化能力的深度学习解决方案。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

《生成式模型: 变分自编码器与扩散模型》，75页ppt，Google DeepMind科学家Ruiqi Gao

专知会员服务

66+阅读 · 2023年6月10日

《用于无线通信和传感的智能反射面 (IRS)》（ICC 2022）新加坡国立大学2022最新53页slides

专知会员服务

26+阅读 · 2022年11月16日

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日