RetinaRegNet: A Versatile Approach for Retinal Image Registration

Vishal Balaji Sivaraman,Muhammad Imran,Qingyue Wei,Preethika Muralidharan,Michelle R. Tamplin,Isabella M . Grumbach,Randy H. Kardon,Jui-Kai Wang,Yuyin Zhou,Wei Shao

We introduce the RetinaRegNet model, which can achieve state-of-the-art performance across various retinal image registration tasks. RetinaRegNet does not require training on any retinal images. It begins by establishing point correspondences between two retinal images using image features derived from diffusion models. This process involves the selection of feature points from the moving image using the SIFT algorithm alongside random point sampling. For each selected feature point, a 2D correlation map is computed by assessing the similarity between the feature vector at that point and the feature vectors of all pixels in the fixed image. The pixel with the highest similarity score in the correlation map corresponds to the feature point in the moving image. To remove outliers in the estimated point correspondences, we first applied an inverse consistency constraint, followed by a transformation-based outlier detector. This method proved to outperform the widely used random sample consensus (RANSAC) outlier detector by a significant margin. To handle large deformations, we utilized a two-stage image registration framework. A homography transformation was used in the first stage and a more accurate third-order polynomial transformation was used in the second stage. The model's effectiveness was demonstrated across three retinal image datasets: color fundus images, fluorescein angiography images, and laser speckle flowgraphy images. RetinaRegNet outperformed current state-of-the-art methods in all three datasets. It was especially effective for registering image pairs with large displacement and scaling deformations. This innovation holds promise for various applications in retinal image analysis. Our code is publicly available at https://github.com/mirthAI/RetinaRegNet .

翻译：我们提出了RetinaRegNet模型，该模型能够在各种视网膜图像配准任务中实现最先进的性能，且无需在任何视网膜图像上进行训练。该方法首先利用扩散模型提取的图像特征建立两幅视网膜图像之间的点对应关系。该过程包括使用SIFT算法结合随机点采样从移动图像中选取特征点，通过计算每个选定特征点的特征向量与固定图像中所有像素特征向量之间的相似性来生成二维相关性图，其中相关性图中相似性得分最高的像素即对应移动图像中的特征点。为去除估计点对应关系中的异常值，我们首先应用逆一致性约束，随后采用基于变换的异常值检测器。实验证明，该方法显著优于广泛使用的随机采样一致性（RANSAC）异常值检测器。为处理大形变，我们采用了两阶段图像配准框架：第一阶段使用单应性变换，第二阶段采用更精确的三阶多项式变换。该模型的有效性在三个视网膜图像数据集（彩色眼底图像、荧光素血管造影图像和激光散斑血流成像图像）上得到验证，在所有三个数据集中均优于当前最先进方法，尤其适用于具有大位移和尺度形变的图像对配准。这项创新为视网膜图像分析中的多种应用提供了可能。我们的代码已公开在https://github.com/mirthAI/RetinaRegNet。

相关内容

图像配准

关注 810

图像配准是图像处理研究领域中的一个典型问题和技术难点，其目的在于比较或融合针对同一对象在不同条件下获取的图像，例如图像会来自不同的采集设备，取自不同的时间，不同的拍摄视角等等，有时也需要用到针对不同对象的图像配准问题。具体地说，对于一组图像数据集中的两幅图像，通过寻找一种空间变换把一幅图像映射到另一幅图像，使得两图中对应于空间同一位置的点一一对应起来，从而达到信息融合的目的。该技术在计算机视觉、医学图像处理以及材料力学等领域都具有广泛的应用。根据具体应用的不同，有的侧重于通过变换结果融合两幅图像，有的侧重于研究变换本身以获得对象的一些力学属性。

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日