Prompting Generative AI with Interaction-Augmented Instructions

The emergence of generative AI (GenAI) models, including large language models and text-to-image models, has significantly advanced the synergy between humans and AI with not only their outstanding capability but more importantly, the intuitive communication method with text prompts. Though intuitive, text-based instructions suffer from natural languages' ambiguous and redundant nature. To address the issue, researchers have explored augmenting text-based instructions with interactions that facilitate precise and effective human intent expression, such as direct manipulation. However, the design strategy of interaction-augmented instructions lacks systematic investigation, hindering our understanding and application. To provide a panorama of interaction-augmented instructions, we propose a framework to analyze related tools from why, when, who, what, and how interactions are applied to augment text-based instructions. Notably, we identify four purposes for applying interactions, including restricting, expanding, organizing, and refining text instructions. The design paradigms for each purpose are also summarized to benefit future researchers and practitioners.

翻译：生成式人工智能（GenAI）模型（包括大语言模型和文生图模型）的出现，显著推动了人类与人工智能的协同发展。这不仅得益于其卓越的能力，更重要的是其通过文本提示实现的直观交互方式。尽管直观，基于文本的指令仍受限于自然语言的模糊性与冗余性。为解决这一问题，研究者探索了通过交互增强文本指令，以促进精确有效的人类意图表达，例如直接操作。然而，交互增强指令的设计策略缺乏系统性研究，阻碍了我们的理解与应用。为全面展现交互增强指令的图景，本文提出一个分析框架，从交互应用的动因、时机、对象、内容及方式五个维度（即为何、何时、何人、何事及如何）对相关工具进行剖析。特别地，我们识别了应用交互的四个目的：限制、扩展、组织与细化文本指令。本文亦总结了各目的的设计范式，以期为未来的研究者与实践者提供参考。

相关内容

关注 7110

人工智能杂志AI(Artificial Intelligence)是目前公认的发表该领域最新研究成果的主要国际论坛。该期刊欢迎有关AI广泛方面的论文，这些论文构成了整个领域的进步，也欢迎介绍人工智能应用的论文，但重点应该放在新的和新颖的人工智能方法如何提高应用领域的性能，而不是介绍传统人工智能方法的另一个应用。关于应用的论文应该描述一个原则性的解决方案，强调其新颖性，并对正在开发的人工智能技术进行深入的评估。官网地址：http://dblp.uni-trier.de/db/journals/ai/

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

35+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日