Denoising diffusion-based MRI to CT image translation enables automated spinal segmentation

Robert Graf,Joachim Schmitt,Sarah Schlaeger,Hendrik Kristian Möller,Vasiliki Sideri-Lampretsa,Anjany Sekuboyina,Sandro Manuel Krieg,Benedikt Wiestler,Bjoern Menze,Daniel Rueckert,Jan Stefan Kirschke

from arxiv, 35 pages, 7 figures, Code and a model weights available https://doi.org/10.5281/zenodo.8221159 and https://doi.org/10.5281/zenodo.8198697

Background: Automated segmentation of spinal MR images plays a vital role both scientifically and clinically. However, accurately delineating posterior spine structures presents challenges. Methods: This retrospective study, approved by the ethical committee, involved translating T1w and T2w MR image series into CT images in a total of n=263 pairs of CT/MR series. Landmark-based registration was performed to align image pairs. We compared 2D paired (Pix2Pix, denoising diffusion implicit models (DDIM) image mode, DDIM noise mode) and unpaired (contrastive unpaired translation, SynDiff) image-to-image translation using "peak signal to noise ratio" (PSNR) as quality measure. A publicly available segmentation network segmented the synthesized CT datasets, and Dice scores were evaluated on in-house test sets and the "MRSpineSeg Challenge" volumes. The 2D findings were extended to 3D Pix2Pix and DDIM. Results: 2D paired methods and SynDiff exhibited similar translation performance and Dice scores on paired data. DDIM image mode achieved the highest image quality. SynDiff, Pix2Pix, and DDIM image mode demonstrated similar Dice scores (0.77). For craniocaudal axis rotations, at least two landmarks per vertebra were required for registration. The 3D translation outperformed the 2D approach, resulting in improved Dice scores (0.80) and anatomically accurate segmentations in a higher resolution than the original MR image. Conclusion: Two landmarks per vertebra registration enabled paired image-to-image translation from MR to CT and outperformed all unpaired approaches. The 3D techniques provided anatomically correct segmentations, avoiding underprediction of small structures like the spinous process.

翻译：背景：脊柱磁共振图像的自动分割在科学研究和临床实践中均具有重要作用。然而，精准勾画脊柱后部结构仍面临挑战。方法：本研究为回顾性研究，经伦理委员会批准，共纳入263对MRI/CT序列，将T1加权和T2加权MRI序列转换为CT图像。采用基于标志点的配准方法对齐图像对。以“峰值信噪比”（PSNR）为质量指标，比较了2D配对（Pix2Pix、去噪扩散隐式模型图像模式、去噪扩散隐式模型噪声模式）与非配对（对比非配对翻译、SynDiff）图像到图像转换方法。利用公开的分割网络对合成CT数据集进行分割，并通过内部测试集与“MRSpineSeg挑战赛”数据集评估Dice系数。2D研究结果被扩展至3D Pix2Pix和去噪扩散隐式模型。结果：在配对数据上，2D配对方法与SynDiff展现出相似的转换性能与Dice系数。去噪扩散隐式模型图像模式获得最高图像质量。SynDiff、Pix2Pix与去噪扩散隐式模型图像模式的Dice系数相近（0.77）。在头尾轴旋转中，每节椎骨至少需要两个标志点进行配准。3D转换优于2D方法，Dice系数提升至0.80，且分割结果在解剖结构精度和分辨率上均优于原始MRI图像。结论：每节椎骨两个标志点的配准方法实现了配对的MRI到CT图像转换，并优于所有非配对方法。3D技术可生成解剖结构正确的分割结果，避免棘突等小结构欠分割的问题。

相关内容

Automator

关注 0

Automator是苹果公司为他们的Mac OS X系统开发的一款软件。 只要通过点击拖拽鼠标等操作就可以将一系列动作组合成一个工作流，从而帮助你自动的（可重复的）完成一些复杂的工作。Automator还能横跨很多不同种类的程序，包括：查找器、Safari网络浏览器、iCal、地址簿或者其他的一些程序。它还能和一些第三方的程序一起工作，如微软的Office、Adobe公司的Photoshop或者Pixelmator等。

【CVPR 2022】基于元内存传输的跨域少镜头语义分割，Remember the Difference: Cross-Domain Few-Shot Semantic Segmentation via Meta-Memory Transfer

专知会员服务

14+阅读 · 2022年3月12日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

35+阅读 · 2019年10月18日