StyMam: A Mamba-Based Generator for Artistic Style Transfer

Image style transfer aims to integrate the visual patterns of a specific artistic style into a content image while preserving its content structure. Existing methods mainly rely on the generative adversarial network (GAN) or stable diffusion (SD). GAN-based approaches using CNNs or Transformers struggle to jointly capture local and global dependencies, leading to artifacts and disharmonious patterns. SD-based methods reduce such issues but often fail to preserve content structures and suffer from slow inference. To address these issues, we revisit GAN and propose a mamba-based generator, termed as StyMam, to produce high-quality stylized images without introducing artifacts and disharmonious patterns. Specifically, we introduce a mamba-based generator with a residual dual-path strip scanning mechanism and a channel-reweighted spatial attention module. The former efficiently captures local texture features, while the latter models global dependencies. Finally, extensive qualitative and quantitative experiments demonstrate that the proposed method outperforms state-of-the-art algorithms in both quality and speed.

翻译：图像风格迁移旨在将特定艺术风格的视觉模式融入内容图像，同时保留其内容结构。现有方法主要依赖于生成对抗网络（GAN）或稳定扩散（SD）。基于GAN的方法使用CNN或Transformer，难以同时捕捉局部与全局依赖关系，导致伪影与不协调的图案。基于SD的方法减少了此类问题，但往往无法有效保留内容结构，且推理速度较慢。为解决这些问题，我们重新审视GAN并提出了一个基于Mamba的生成器，称为StyMam，以生成高质量的风格化图像，同时避免引入伪影与不协调的图案。具体而言，我们引入了一种基于Mamba的生成器，其包含残差双路径条带扫描机制与通道重加权空间注意力模块。前者高效捕捉局部纹理特征，后者则建模全局依赖关系。最后，大量的定性与定量实验表明，所提方法在质量与速度上均优于现有先进算法。

相关内容

生成器

关注 2

生成器是一次生成一个值的特殊类型函数。可以将其视为可恢复函数。调用该函数将返回一个可用于生成连续 x 值的生成【Generator】，简单的说就是在函数的执行过程中，yield语句会把你需要的值返回给调用生成器的地方，然后退出函数，下一次调用生成器函数的时候又从上次中断的地方开始执行，而生成器内的所有变量参数都会被保存下来供下一次使用。

基于神经网络的图像风格迁移算法综述

专知会员服务

12+阅读 · 2025年5月29日

【ICCV2023】StyleDiffusion:基于扩散模型的可控解缠风格迁移

专知会员服务

26+阅读 · 2023年8月20日

Transformer如何用于3D视觉？阿联酋MBZUAI最新《3D视觉Transformers处理》综述，涵盖100+种方法

专知会员服务

39+阅读 · 2022年8月9日

【CVPR 2022】基于Transformer的图象风格化，StyTr2: Image Style Transfer with Transformers

专知会员服务

11+阅读 · 2022年3月19日