BadBlocks: Low-Cost and Stealthy Backdoor Attacks Tailored for Text-to-Image Diffusion Models

Despite the remarkable progress of diffusion models in image generation, recent studies reveal their vulnerability to backdoor attacks via covert visual or textual triggers. Although evolving defense mechanisms can detect most existing threats through visual inspection or feature analysis, we introduce BadBlocks-a novel, lightweight, and highly covert attack that challenges these safeguards. By selectively poisoning specific blocks within the UNet architecture while keeping other components intact, BadBlocks requires only 30% of the computational resources and 20% of the GPU time of conventional attacks, effectively democratizing backdoor injection on consumer-grade GPUs. Empirical evaluations demonstrate that BadBlocks achieves a high attack success rate with negligible perceptual quality loss, while successfully bypassing state-of-the-art defenses, particularly attention-based detection frameworks. Layer-level ablation studies further confirm that backdoor mapping does not require full-network fine-tuning, revealing the disparate vulnerability of different neural layers. Overall, BadBlocks significantly lowers the barrier for executing backdoor attacks, presenting a critical security risk. Our code is available at: https://github.com/paoche11/BadBlocks.

翻译：尽管扩散模型在图像生成领域取得了显著进展，但近期研究表明，其易受到通过隐蔽视觉或文本触发器的后门攻击。虽然不断演进的防御机制可通过视觉检测或特征分析识别大多数现有威胁，我们提出了BadBlocks——一种新颖、轻量级且高度隐蔽的攻击方法，用以挑战这些防护措施。通过选择性污染UNet架构中的特定模块，同时保持其他组件完好无损，BadBlocks仅需传统攻击30%的计算资源和20%的GPU时间，有效实现了在消费级GPU上进行后门注入的普及化。实验评估表明，BadBlocks在攻击成功率高且感知质量损失可忽略不计的同时，成功绕过了最先进的防御技术，尤其是基于注意力的检测框架。逐层消融研究进一步证实，后门映射无需全网络微调，揭示出不同神经层在脆弱性上的差异性。总体而言，BadBlocks显著降低了执行后门攻击的门槛，构成了重大的安全风险。我们的代码位于：https://github.com/paoche11/BadBlocks。

相关内容

MoDELS

关注 46

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

计算机视觉领域的后门攻击与防御：综述

专知会员服务

20+阅读 · 2025年9月13日

深度学习中的架构后门：漏洞、检测与防御综述

专知会员服务

12+阅读 · 2025年7月19日

面向深度学习的后门攻击及防御研究综述

专知会员服务

13+阅读 · 2025年7月4日

【CVPR2025】BadToken：针对多模态大语言模型的词元级后门攻击

专知会员服务

10+阅读 · 2025年3月22日