EmoAttack：面向情感后门生成的情感到图像扩散模型 (EmoAttack: Emotion-to-Image Diffusion Models for Emotional Backdoor Generation)

Text-to-image diffusion models can generate realistic images based on textual inputs, enabling users to convey their opinions visually through language. Meanwhile, within language, emotion plays a crucial role in expressing personal opinions in our daily lives and the inclusion of maliciously negative content can lead users astray, exacerbating negative emotions. Recognizing the success of diffusion models and the significance of emotion, we investigate a previously overlooked risk associated with text-to-image diffusion models, that is, utilizing emotion in the input texts to introduce negative content and provoke unfavorable emotions in users. Specifically, we identify a new backdoor attack, i.e., emotion-aware backdoor attack (EmoAttack), which introduces malicious negative content triggered by emotional texts during image generation. We formulate such an attack as a diffusion personalization problem to avoid extensive model retraining and propose the EmoBooth. Unlike existing personalization methods, our approach fine-tunes a pre-trained diffusion model by establishing a mapping between a cluster of emotional words and a given reference image containing malicious negative content. To validate the effectiveness of our method, we built a dataset and conducted extensive analysis and discussion about its effectiveness. Given consumers' widespread use of diffusion models, uncovering this threat is critical for society.

翻译：文本到图像扩散模型能够根据文本输入生成逼真的图像，使用户能够通过语言以视觉方式表达其观点。与此同时，在语言中，情感在日常生活中表达个人观点时起着至关重要的作用，而包含恶意负面内容可能误导用户，加剧负面情绪。鉴于扩散模型取得的成功以及情感的重要性，我们研究了一种先前被忽视的与文本到图像扩散模型相关的风险，即利用输入文本中的情感引入负面内容并引发用户的不良情绪。具体而言，我们识别出一种新型后门攻击——情感感知后门攻击（EmoAttack），该攻击在图像生成过程中由情感文本触发恶意负面内容。我们将此类攻击形式化为扩散模型个性化问题，以避免大量模型重新训练，并提出了EmoBooth方法。与现有个性化方法不同，我们的方法通过建立一组情感词与包含恶意负面内容的给定参考图像之间的映射，对预训练的扩散模型进行微调。为验证方法的有效性，我们构建了一个数据集，并对其效果进行了广泛的分析与讨论。鉴于扩散模型在消费者中的广泛使用，揭示这一威胁对社会至关重要。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

【亚马逊-WWW2020】不解析,生成!用于面向任务的语义分析的序列到序列体系结构，Don't Parse, Generate! A Sequence to Sequence Architecture for Task-Oriented Semantic Parsing

专知会员服务

15+阅读 · 2020年2月1日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日