Generative, multimodal artificial intelligence (GenAI) offers transformative potential across industries, but its misuse poses significant risks. Prior research has shed light on the potential of advanced AI systems to be exploited for malicious purposes. However, we still lack a concrete understanding of how GenAI models are specifically exploited or abused in practice, including the tactics employed to inflict harm. In this paper, we present a taxonomy of GenAI misuse tactics, informed by existing academic literature and a qualitative analysis of approximately 200 observed incidents of misuse reported between January 2023 and March 2024. Through this analysis, we illuminate key and novel patterns in misuse during this time period, including potential motivations, strategies, and how attackers leverage and abuse system capabilities across modalities (e.g. image, text, audio, video) in the wild.
翻译:生成式多模态人工智能(GenAI)在各行业展现出变革性潜力,但其滥用行为带来了重大风险。现有研究揭示了高级人工智能系统可能被恶意利用的潜在风险,然而我们仍缺乏对GenAI模型在实践中的具体利用或滥用方式(包括造成危害的战术手段)的实质性理解。本文基于现有学术文献以及对2023年1月至2024年3月期间报告的约200起已观测滥用事件的定性分析,提出了GenAI滥用战术的分类体系。通过该分析,我们揭示了这一时期滥用行为的关键性与新颖性模式,包括潜在动机、实施策略,以及攻击者如何在实际场景中跨模态(如图像、文本、音频、视频)利用和滥用系统能力。