The aim of this paper is to propose a mechanism to efficiently and explicitly model image hierarchies in the global, regional, and local range for image restoration. To achieve that, we start by analyzing two important properties of natural images including cross-scale similarity and anisotropic image features. Inspired by that, we propose the anchored stripe self-attention which achieves a good balance between the space and time complexity of self-attention and the modelling capacity beyond the regional range. Then we propose a new network architecture dubbed GRL to explicitly model image hierarchies in the Global, Regional, and Local range via anchored stripe self-attention, window self-attention, and channel attention enhanced convolution. Finally, the proposed network is applied to 7 image restoration types, covering both real and synthetic settings. The proposed method sets the new state-of-the-art for several of those. Code will be available at https://github.com/ofsoundof/GRL-Image-Restoration.git.
翻译:本文旨在提出一种机制,用于在全局、区域和局部范围内高效显式地建模图像层次结构以实现图像恢复。为此,我们首先分析自然图像的两个重要特性,包括跨尺度相似性和各向异性图像特征。受此启发,我们提出锚定条状自注意力机制,它在自注意力的空间与时间复杂度以及超出区域范围的建模能力之间取得了良好平衡。接着,我们提出名为GRL的新型网络架构,通过锚定条状自注意力、窗口自注意力和通道注意力增强卷积,在全局、区域和局部范围内显式建模图像层次结构。最后,将所提网络应用于7种图像恢复类型,涵盖真实和合成场景。所提方法在多项任务上创下新的最优性能。代码将发布于 https://github.com/ofsoundof/GRL-Image-Restoration.git。