``Theory of Mind" (ToM) is the ability to understand human thinking and decision-making, an ability that plays a crucial role in many types of social interaction between people, including linguistic communication. This paper investigates to what extent recent Large Language Models in the ChatGPT tradition possess ToM. Focussing on six well-known ToM problems, we posed each problem to two versions of ChatGPT and compared the results under a range of prompting strategies. While the results concerning ChatGPT-3 were somewhat inconclusive, ChatGPT-4 was shown to arrive at the correct answers more often than would be expected based on chance, although correct answers were often arrived at on the basis of false assumptions or invalid reasoning.
翻译:“心理理论”(Theory of Mind, ToM)是指理解人类思维和决策的能力,这种能力在人际社会互动(包括语言交流)中扮演着关键角色。本文研究了基于ChatGPT框架的近期大语言模型在多大程度上具备心理理论。我们以六个经典心理理论问题为测试内容,向两个版本的ChatGPT分别提出这些问题,并在多种提示策略下对结果进行了比较。虽然关于ChatGPT-3的结果尚不确定,但ChatGPT-4在多数情况下能比随机水平更频繁地得出正确答案,不过这些正确答案往往基于错误假设或无效推理得出。