Acquisition state behaves as a structured, measurable variable governing lung-nodule AI: kernel-driven measurement instability and noise-driven detection fragility, invisible to DICOM metadata

翻译：采集状态作为调控肺结节AI的结构化、可测量变量：核驱动测量不稳定性和噪声驱动检测脆弱性——DICOM元数据不可见

Daniel Soliman

AI governance for medical imaging is formalizing: the 2026 ACR-SIIM Practice Parameter recommends local acceptance testing and ongoing drift monitoring, and the ACR Assess-AI registry monitors AI outputs using DICOM metadata for context. We argue that a necessary, currently unmonitored layer sits beneath output metrics: whether incoming studies remain within the acquisition envelope a model was validated on. Using a LUNA16-trained MONAI RetinaNet lung-nodule detector, we test whether acquisition state behaves as a structured, measurable variable. On real paired CT differing only in reconstruction kernel (NLST B30f vs B80f), kernel alone shifted AI-measured diameter and flipped a Fleischner size category in 5.2% (8 of 155) of nodules at fixed patient and acquisition, while detection confidence was unchanged (Wilcoxon p=0.22). Under controlled LIDC-IDRI perturbations the effects dissociated by axis: the noise axis degraded detection confidence (p=5.9e-32, concentrated in nodules under 6 mm) but not measurement, while the frequency/kernel axis corrupted measurement (p=8.6e-13) but not detection. A 4-feature pixel fingerprint recovered reconstruction identity (patient-level AUC about 0.95 on real CT, 0.995 on a QIBA phantom) where the ConvolutionKernel DICOM tag was uninformative (identical labels across reconstructions). The kernel axis transported across four manufacturers (leave-one-vendor-out AUC 0.94-0.98, matching the within-vendor ceiling). Acquisition state thus maps to distinct AI failure modes, frequency content to measurement reliability and noise to detection sensitivity, and is not recoverable from metadata. Acquisition-aware, input-side validation is the missing layer for the acceptance-testing and drift-monitoring requirements now entering imaging-AI accreditation.

翻译：医学影像人工智能治理正趋于规范化：2026年ACR-SIIM实践参数建议开展本地验收测试和持续漂移监测，ACR Assess-AI注册中心通过DICOM元数据监测AI输出结果。我们主张，在输出指标之下存在一个必要但尚未被监测的层面：传入研究是否仍处于模型验证时的采集包络范围内。使用基于LUNA16训练的MONAI RetinaNet肺结节检测器，我们验证采集状态是否表现为结构化的、可测量的变量。在仅重建核不同的真实配对CT（NLST B30f vs B80f）中，固定患者及采集条件下，仅核变化便使AI测量直径发生偏移，并在5.2%（155个结节中的8个）的结节中翻转Fleischner尺寸分类，而检测置信度无显著变化（Wilcoxon检验p=0.22）。在受控LIDC-IDRI扰动下，上述效应沿不同轴解离：噪声轴降低检测置信度（p=5.9e-32，集中于6毫米以下结节）而不影响测量，而频率/核轴破坏测量（p=8.6e-13）但不影响检测。基于4个特征的像素指纹可重建重建身份（真实CT上患者水平AUC约0.95，QIBA体模上为0.995），而DICOM标签ConvolutionKernel在此场景下无信息价值（不同重建的标签完全相同）。核轴效应可跨四家制造商传递（留一供应商验证AUC 0.94-0.98，与供应商内上限持平）。因此，采集状态对应不同的AI失效模式——频率内容影响测量可靠性，噪声影响检测灵敏度——且无法通过元数据恢复。面向采集的输入侧验证，正是当前影像AI认证体系中验收测试和漂移监测要求所缺失的关键环节。

相关内容

关注 7110

人工智能杂志AI(Artificial Intelligence)是目前公认的发表该领域最新研究成果的主要国际论坛。该期刊欢迎有关AI广泛方面的论文，这些论文构成了整个领域的进步，也欢迎介绍人工智能应用的论文，但重点应该放在新的和新颖的人工智能方法如何提高应用领域的性能，而不是介绍传统人工智能方法的另一个应用。关于应用的论文应该描述一个原则性的解决方案，强调其新颖性，并对正在开发的人工智能技术进行深入的评估。官网地址：http://dblp.uni-trier.de/db/journals/ai/

【博士论文】数据驱动决策：通过数据集成与预测性决策支持优化重症监护

专知会员服务

20+阅读 · 2月10日

在ISR中利用人工智能：跨多种数据的威胁识别，用于综合辐射源分析

专知会员服务

47+阅读 · 2024年1月12日

重磅!《“可信AI”评估体系产品手册》正式发布,24页pdf

专知会员服务

77+阅读 · 2023年7月4日

【剑桥大学博士论文】基于弱监督的结构化数据学习，210页pdf

专知会员服务

28+阅读 · 2023年6月19日