Synthetic Socratic Debates: Examining Persona Effects on Moral Decision and Persuasion Dynamics

As large language models (LLMs) are increasingly used in morally sensitive domains, it is crucial to understand how persona traits affect their moral reasoning and persuasive behavior. We present the first large-scale study of multi-dimensional persona effects in AI-AI debates over real-world moral dilemmas. Using a 6-dimensional persona space (age, gender, country, class, ideology, and personality), we simulate structured debates between AI agents over 131 relationship-based cases. Our results show that personas affect initial moral stances and debate outcomes, with political ideology and personality traits exerting the strongest influence. Persuasive success varies across traits, with liberal and open personalities reaching higher consensus and win rates. While logit-based confidence grows during debates, emotional and credibility-based appeals diminish, indicating more tempered argumentation over time. These trends mirror findings from psychology and cultural studies, reinforcing the need for persona-aware evaluation frameworks for AI moral reasoning.

翻译：随着大型语言模型（LLM）日益应用于道德敏感领域，理解角色特质如何影响其道德推理与说服行为至关重要。本研究首次针对现实世界道德困境中AI-AI辩论的多维角色效应展开大规模实证分析。通过构建六维角色空间（年龄、性别、国家、阶级、意识形态与人格特质），我们在131个基于人际关系的情境案例中模拟了结构化AI代理辩论。研究结果表明：角色特质显著影响初始道德立场与辩论结果，其中政治意识形态与人格特质的作用最为突出；不同特质在说服效能上存在差异，自由派与开放性人格达成更高共识且胜率更优；尽管辩论过程中基于逻辑的置信度持续增长，情感诉求与可信度策略的效力却逐渐衰减，表明论证方式随时间推移趋于理性化。这些趋势与心理学及文化研究领域的既有发现相互印证，进一步突显了建立角色感知的AI道德推理评估框架的必要性。

相关内容

关注 7103

人工智能杂志AI(Artificial Intelligence)是目前公认的发表该领域最新研究成果的主要国际论坛。该期刊欢迎有关AI广泛方面的论文，这些论文构成了整个领域的进步，也欢迎介绍人工智能应用的论文，但重点应该放在新的和新颖的人工智能方法如何提高应用领域的性能，而不是介绍传统人工智能方法的另一个应用。关于应用的论文应该描述一个原则性的解决方案，强调其新颖性，并对正在开发的人工智能技术进行深入的评估。官网地址：http://dblp.uni-trier.de/db/journals/ai/

迈向个性化大语言模型驱动的智能体：基础、评估与未来方向

专知会员服务

26+阅读 · 2月27日

法律领域中的大语言模型智能体：分类体系、应用场景与挑战

专知会员服务

17+阅读 · 1月14日

基于大语言模型的智能体易产生幻觉：分类体系、方法与未来方向综述

专知会员服务

31+阅读 · 2025年9月27日

【博士论文】《自然语言处理中的因果推理》

专知会员服务

24+阅读 · 2025年4月25日