Towards Implicit Prompt For Text-To-Image Models

Recent text-to-image (T2I) models have had great success, and many benchmarks have been proposed to evaluate their performance and safety. However, they only consider explicit prompts while neglecting implicit prompts (hint at a target without explicitly mentioning it). These prompts may get rid of safety constraints and pose potential threats to the applications of these models. This position paper highlights the current state of T2I models toward implicit prompts. We present a benchmark named ImplicitBench and conduct an investigation on the performance and impacts of implicit prompts with popular T2I models. Specifically, we design and collect more than 2,000 implicit prompts of three aspects: General Symbols, Celebrity Privacy, and Not-Safe-For-Work (NSFW) Issues, and evaluate six well-known T2I models' capabilities under these implicit prompts. Experiment results show that (1) T2I models are able to accurately create various target symbols indicated by implicit prompts; (2) Implicit prompts bring potential risks of privacy leakage for T2I models. (3) Constraints of NSFW in most of the evaluated T2I models can be bypassed with implicit prompts. We call for increased attention to the potential and risks of implicit prompts in the T2I community and further investigation into the capabilities and impacts of implicit prompts, advocating for a balanced approach that harnesses their benefits while mitigating their risks.

翻译：近年来，文本到图像（T2I）模型取得了巨大成功，许多基准测试被提出以评估其性能和安全性。然而，这些基准仅考虑显式提示，而忽略了隐式提示（暗示目标但不明确提及的提示）。这些提示可能绕过安全限制，对模型的应用构成潜在威胁。本立场论文强调了T2I模型在隐式提示方面的当前状态。我们提出了一个名为ImplicitBench的基准测试，并针对流行T2I模型在隐式提示下的性能和影响进行了研究。具体而言，我们设计并收集了超过2000个隐式提示，涵盖三个方面：通用符号、名人隐私以及不安全内容（NSFW）问题，并评估了六个知名T2I模型在这些隐式提示下的能力。实验结果表明：（1）T2I模型能够根据隐式提示准确生成各种目标符号；（2）隐式提示为T2I模型带来了隐私泄露的潜在风险；（3）大多数被评估的T2I模型中的NSFW限制可以通过隐式提示绕过。我们呼吁T2I社区更加关注隐式提示的潜力与风险，并进一步研究其能力与影响，倡导一种平衡的方法，以在利用其优势的同时减轻风险。

相关内容

MoDELS

关注 46

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

【亚马逊-WWW2020】不解析,生成!用于面向任务的语义分析的序列到序列体系结构，Don't Parse, Generate! A Sequence to Sequence Architecture for Task-Oriented Semantic Parsing

专知会员服务

15+阅读 · 2020年2月1日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

35+阅读 · 2019年10月18日