Fully $1\times1$ Convolutional Network for Lightweight Image Super-Resolution

Deep models have achieved significant process on single image super-resolution (SISR) tasks, in particular large models with large kernel ($3\times3$ or more). However, the heavy computational footprint of such models prevents their deployment in real-time, resource-constrained environments. Conversely, $1\times1$ convolutions bring substantial computational efficiency, but struggle with aggregating local spatial representations, an essential capability to SISR models. In response to this dichotomy, we propose to harmonize the merits of both $3\times3$ and $1\times1$ kernels, and exploit a great potential for lightweight SISR tasks. Specifically, we propose a simple yet effective fully $1\times1$ convolutional network, named Shift-Conv-based Network (SCNet). By incorporating a parameter-free spatial-shift operation, it equips the fully $1\times1$ convolutional network with powerful representation capability while impressive computational efficiency. Extensive experiments demonstrate that SCNets, despite its fully $1\times1$ convolutional structure, consistently matches or even surpasses the performance of existing lightweight SR models that employ regular convolutions. The code and pre-trained models can be found at https://github.com/Aitical/SCNet.

翻译：深度模型在单图像超分辨率（SISR）任务上取得了显著进展，尤其大核（$3\times3$或更大）的大模型表现突出。然而，这类模型的高计算开销限制了其在实时、资源受限环境中的部署。相反，$1\times1$卷积虽能大幅提升计算效率，却难以聚合局部空间表示——这是SISR模型的关键能力。针对这一矛盾，我们提出调和$3\times3$与$1\times1$核的优势，发掘轻量化SISR任务的巨大潜力。具体而言，我们设计了一种简单而有效的全$1\times1$卷积网络——基于Shift-Conv的网络（SCNet）。通过引入无参数的空间移位操作，该网络使全$1\times1$卷积结构兼具强大的表示能力与卓越的计算效率。大量实验表明，尽管采用全$1\times1$卷积结构，SCNet仍能稳定达到甚至超越现有使用常规卷积的轻量化超分辨模型的性能。代码与预训练模型已发布于https://github.com/Aitical/SCNet。

相关内容

Networking

关注 23

Networking：IFIP International Conferences on Networking。 Explanation：国际网络会议。 Publisher：IFIP。 SIT： http://dblp.uni-trier.de/db/conf/networking/index.html

《生成式模型: 变分自编码器与扩散模型》，75页ppt，Google DeepMind科学家Ruiqi Gao

专知会员服务

66+阅读 · 2023年6月10日

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日