Here we examine how AI agent "personalities" interact with human personalities to shape human-AI collaboration and performance. In a large-scale, preregistered randomized experiment, we paired 1,258 participants with AI agents prompted to exhibit varying levels of the Big Five personality traits. These human-AI teams produced 7,266 display ads for a real think tank, which we evaluated using 1,995 independent human raters and a field experiment on X that generated nearly 5 million impressions. We found that human and AI personalities individually shaped ad quality and teamwork. When examined together, human-AI personality pairings directly effected ad quality outcomes. For example, extraverted humans paired with conscientious AI produced the lowest-quality ads, followed by conscientious humans paired with agreeable AI and neurotic humans paired with conscientious AI. In the field experiment, ad quality significantly influenced ad performance, measured by click-through rates and cost-per-click, and neurotic humans paired with neurotic AI achieved higher click-through rates, even after controlling for ad quality. Together, these results provide the first large-scale causal experimental evidence that specific personality pairings can improve human-AI collaboration and motivate future research on the implications of AI personalization for performance and teamwork dynamics in human-AI teams.
翻译:本研究通过大规模预注册随机实验,探究AI智能体的“人格特质”如何与人类人格相互作用,进而影响人机协作效能。我们为1258名参与者匹配了经提示呈现不同大五人格特质的AI智能体,这些人机团队为真实智库生成了7266条展示广告,并通过1995名独立人类评估者及在X平台上开展的田野实验(累计产生近500万次曝光)进行评估。研究发现,人类与AI的人格特质分别对广告质量与团队协作效能产生独立影响。当将两者联合分析时,人机人格匹配模式直接作用于广告质量——例如,外向人类与尽责AI的配对产出质量最低的广告,其次为尽责人类与宜人AI的配对、神经质人类与尽责AI的配对。田野实验表明,广告质量显著影响点击率与单次点击成本等绩效指标,且在控制广告质量后,神经质人类与神经质AI的配对仍能获得更高点击率。这些结果首次为大规模因果实验证据所证实——特定人格匹配模式可优化人机协作效能,并为探究AI个性化对人机团队绩效与协作动力学的启示提供了研究方向。