Fourier-Net: Fast Image Registration with Band-limited Deformation

Unsupervised image registration commonly adopts U-Net style networks to predict dense displacement fields in the full-resolution spatial domain. For high-resolution volumetric image data, this process is however resource-intensive and time-consuming. To tackle this problem, we propose the Fourier-Net, replacing the expansive path in a U-Net style network with a parameter-free model-driven decoder. Specifically, instead of our Fourier-Net learning to output a full-resolution displacement field in the spatial domain, we learn its low-dimensional representation in a band-limited Fourier domain. This representation is then decoded by our devised model-driven decoder (consisting of a zero padding layer and an inverse discrete Fourier transform layer) to the dense, full-resolution displacement field in the spatial domain. These changes allow our unsupervised Fourier-Net to contain fewer parameters and computational operations, resulting in faster inference speeds. Fourier-Net is then evaluated on two public 3D brain datasets against various state-of-the-art approaches. For example, when compared to a recent transformer-based method, named TransMorph, our Fourier-Net, which only uses 2.2\% of its parameters and 6.66\% of the multiply-add operations, achieves a 0.5\% higher Dice score and an 11.48 times faster inference speed. Code is available at \url{https://github.com/xi-jia/Fourier-Net}.

翻译：无监督图像配准通常采用U-Net风格网络在满分辨率空间域中预测密集位移场。然而，对于高分辨率体积图像数据而言，此过程计算资源消耗大且耗时严重。为解决该问题，我们提出傅立叶网络（Fourier-Net），用无参数模型驱动解码器替代U-Net风格网络中的扩展路径。具体而言，我们的Fourier-Net并非学习在空间域输出满分辨率位移场，而是在带限傅立叶域中学习其低维表示。随后，该表示由我们设计的模型驱动解码器（包含零填充层和逆离散傅立叶变换层）解码为空间域中密集的满分辨率位移场。这些改进使得无监督Fourier-Net包含更少的参数和计算操作，从而获得更快的推理速度。我们在两个公开3D脑部数据集上对比多种先进方法评估了Fourier-Net。例如，与近期基于Transformer的方法TransMorph相比，仅使用其2.2%参数和6.66%乘法累加运算量的Fourier-Net，Dice得分提高了0.5%，推理速度提升了11.48倍。代码已开源在\url{https://github.com/xi-jia/Fourier-Net}。

相关内容

图像配准

关注 810

图像配准是图像处理研究领域中的一个典型问题和技术难点，其目的在于比较或融合针对同一对象在不同条件下获取的图像，例如图像会来自不同的采集设备，取自不同的时间，不同的拍摄视角等等，有时也需要用到针对不同对象的图像配准问题。具体地说，对于一组图像数据集中的两幅图像，通过寻找一种空间变换把一幅图像映射到另一幅图像，使得两图中对应于空间同一位置的点一一对应起来，从而达到信息融合的目的。该技术在计算机视觉、医学图像处理以及材料力学等领域都具有广泛的应用。根据具体应用的不同，有的侧重于通过变换结果融合两幅图像，有的侧重于研究变换本身以获得对象的一些力学属性。

O’Reilly报告：知识图谱崛起——面向现代数据集成和数据结构体系，“The Rise of the Knowledge Graph——Toward Modern Data Integration and the Data Fabric Architecture”

专知会员服务

49+阅读 · 2022年2月18日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日