Unmasking unlearnable models: a classification challenge for biomedical images without visible cues

Predicting traits from images lacking visual cues is challenging, as algorithms are designed to capture visually correlated ground truth. This problem is critical in biomedical sciences, and their solution can improve the efficacy of non-invasive methods. For example, a recent challenge of predicting MGMT methylation status from MRI images is critical for treatment decisions of glioma patients. Using less robust models poses a significant risk in these critical scenarios and underscores the urgency of addressing this issue. Despite numerous efforts, contemporary models exhibit suboptimal performance, and underlying reasons for this limitation remain elusive. In this study, we demystify the complexity of MGMT status prediction through a comprehensive exploration by performing benchmarks of existing models adjoining transfer learning. Their architectures were further dissected by observing gradient flow across layers. Additionally, a feature selection strategy was applied to improve model interpretability. Our finding highlighted that current models are unlearnable and may require new architectures to explore applications in the real world. We believe our study will draw immediate attention and catalyse advancements in predictive modelling with non-visible cues.

翻译：从缺乏视觉线索的图像中预测特征具有挑战性，因为算法旨在捕捉与视觉相关的真实情况。这一问题在生物医学科学中至关重要，其解决方案可提升非侵入性方法的效能。例如，近期从MRI图像预测MGMT甲基化状态的挑战对于胶质瘤患者的治疗决策至关重要。在这些关键场景中使用稳健性不足的模型会带来重大风险，并凸显了解决此问题的紧迫性。尽管已有诸多尝试，现有模型仍表现出次优性能，且导致此局限的根本原因尚不明确。在本研究中，我们通过对现有模型进行迁移学习结合的基准测试，全面探索了MGMT状态预测的复杂性。通过观察各层梯度流进一步剖析了模型架构。此外，应用特征选择策略以提升模型可解释性。我们的发现强调，当前模型具有不可学习性，可能需要新架构来探索现实世界中的应用。我们相信本研究将引起学界即时关注，并推动基于非可见线索的预测建模领域的发展。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

《生成式模型: 变分自编码器与扩散模型》，75页ppt，Google DeepMind科学家Ruiqi Gao

专知会员服务

66+阅读 · 2023年6月10日

《多范式建模与仿真：系统工程视角》CMU 2022最新24页slides

专知会员服务

59+阅读 · 2022年11月4日

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

最新《Transformers模型》教程，64页ppt

专知会员服务

326+阅读 · 2020年11月26日