Physics-Driven Autoregressive State Space Models for Medical Image Reconstruction

Medical image reconstruction from undersampled acquisitions is an ill-posed problem that involves inversion of the imaging operator linking measurement and image domains. In recent years, physics-driven (PD) models have gained prominence in learning-based reconstruction given their enhanced balance between efficiency and performance. For reconstruction, PD models cascade data-consistency modules that enforce fidelity to acquired data based on the imaging operator, with network modules that process feature maps to alleviate image artifacts due to undersampling. Success in artifact suppression inevitably depends on the ability of the network modules to tease apart artifacts from underlying tissue structures, both of which can manifest contextual relations over broad spatial scales. Convolutional modules that excel at capturing local correlations are relatively insensitive to non-local context. While transformers promise elevated sensitivity to non-local context, practical implementations often suffer from a suboptimal trade-off between local and non-local sensitivity due to intrinsic model complexity. Here, we introduce a novel physics-driven autoregressive state space model (MambaRoll) for enhanced fidelity in medical image reconstruction. In each cascade of an unrolled architecture, MambaRoll employs an autoregressive framework based on physics-driven state space modules (PSSM), where PSSMs efficiently aggregate contextual features at a given spatial scale while maintaining fidelity to acquired data, and autoregressive prediction of next-scale feature maps from earlier spatial scales enhance capture of multi-scale contextual features. Demonstrations on accelerated MRI and sparse-view CT reconstructions indicate that MambaRoll outperforms state-of-the-art PD methods based on convolutional, transformer and conventional SSM modules.

翻译：医学图像重建从欠采样采集是一个不适定问题，涉及成像算子连接测量域与图像域的逆问题求解。近年来，物理驱动（PD）模型在基于学习的重建中日益突出，因其在效率与性能之间实现了更优的平衡。对于重建任务，PD模型将基于成像算子、强制与采集数据一致性的数据一致性模块，与处理特征图以减轻欠采样所致图像伪影的网络模块级联。伪影抑制的成功必然依赖于网络模块区分伪影与底层组织结构的能力，二者均可能在广阔的空间尺度上表现出上下文关联。擅长捕捉局部相关性的卷积模块对非局部上下文相对不敏感。尽管Transformer有望提升对非局部上下文的敏感性，但实际实现常因内在模型复杂性而在局部与非局部敏感性之间面临次优权衡。本文提出一种新颖的物理驱动自回归状态空间模型（MambaRoll），以提升医学图像重建的保真度。在展开架构的每一级联中，MambaRoll采用基于物理驱动状态空间模块（PSSM）的自回归框架，其中PSSM在给定空间尺度上高效聚合上下文特征，同时保持与采集数据的一致性，而基于先前空间尺度对下一尺度特征图的自回归预测则增强了对多尺度上下文特征的捕捉。在加速MRI与稀疏视图CT重建上的实验表明，MambaRoll在性能上超越了基于卷积、Transformer及传统SSM模块的先进PD方法。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

【亚马逊-WWW2020】不解析,生成!用于面向任务的语义分析的序列到序列体系结构，Don't Parse, Generate! A Sequence to Sequence Architecture for Task-Oriented Semantic Parsing

专知会员服务

15+阅读 · 2020年2月1日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日