Why Registration Quality Matters: Enhancing sCT Synthesis with IMPACT-Based Registration

We participated in the SynthRAD2025 challenge (Tasks 1 and 2) with a unified pipeline for synthetic CT (sCT) generation from MRI and CBCT, implemented using the KonfAI framework. Our model is a 2.5D U-Net++ with a ResNet-34 encoder, trained jointly across anatomical regions and fine-tuned per region. The loss function combined pixel-wise L1 loss with IMPACT-Synth, a perceptual loss derived from SAM and TotalSegmentator to enhance structural fidelity. Training was performed using AdamW (initial learning rate = 0.001, halved every 25k steps) on patch-based, normalized, body-masked inputs (320x320 for MRI, 256x256 for CBCT), with random flipping as the only augmentation. No post-processing was applied. Final predictions leveraged test-time augmentation and five-fold ensembling. The best model was selected based on validation MAE. Two registration strategies were evaluated: (i) Elastix with mutual information, consistent with the challenge pipeline, and (ii) IMPACT, a feature-based similarity metric leveraging pretrained segmentation networks. On the local test sets, IMPACT-based registration achieved more accurate and anatomically consistent alignments than mutual-information-based registration, resulting in improved sCT synthesis with lower MAE and more realistic anatomical structures. On the public validation set, however, models trained with Elastix-aligned data achieved higher scores, reflecting a registration bias favoring alignment strategies consistent with the evaluation pipeline. This highlights how registration errors can propagate into supervised learning, influencing both training and evaluation, and potentially inflating performance metrics at the expense of anatomical fidelity. By promoting anatomically consistent alignment, IMPACT helps mitigate this bias and supports the development of more robust and generalizable sCT synthesis models.

翻译：我们使用KonfAI框架实现了一个从MRI和CBCT生成合成CT（sCT）的统一流程，参与了SynthRAD2025挑战赛（任务1和2）。我们的模型是一个采用ResNet-34编码器的2.5D U-Net++，在解剖区域上进行联合训练，并针对每个区域进行微调。损失函数结合了逐像素L1损失与IMPACT-Synth——一种源自SAM和TotalSegmentator的感知损失，以增强结构保真度。训练使用AdamW优化器（初始学习率=0.001，每25k步减半），在基于图像块、经过归一化、身体掩码处理的输入（MRI为320x320，CBCT为256x256）上进行，随机翻转是唯一的增强方式。未应用任何后处理。最终预测利用了测试时增强和五折集成。最佳模型根据验证集MAE选择。我们评估了两种配准策略：（i）使用互信息的Elastix，与挑战赛流程一致；（ii）IMPACT，一种利用预训练分割网络的特征相似性度量。在本地测试集上，基于IMPACT的配准比基于互信息的配准实现了更准确且解剖结构更一致的配准，从而改善了sCT合成效果，获得了更低的MAE和更真实的解剖结构。然而，在公共验证集上，使用Elastix配准数据训练的模型获得了更高的分数，这反映出一种配准偏差，即偏向与评估流程一致的配准策略。这突显了配准误差如何传播到监督学习中，影响训练和评估，并可能以牺牲解剖保真度为代价来夸大性能指标。通过促进解剖结构一致的配准，IMPACT有助于减轻这种偏差，并支持开发更鲁棒、更可泛化的sCT合成模型。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【CVPR 2022】基于元内存传输的跨域少镜头语义分割，Remember the Difference: Cross-Domain Few-Shot Semantic Segmentation via Meta-Memory Transfer

专知会员服务

14+阅读 · 2022年3月12日

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

【亚马逊-WWW2020】不解析,生成!用于面向任务的语义分析的序列到序列体系结构，Don't Parse, Generate! A Sequence to Sequence Architecture for Task-Oriented Semantic Parsing

专知会员服务

15+阅读 · 2020年2月1日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

35+阅读 · 2019年10月18日