Theory of Mind (ToM) is the ability to understand human thinking and decision-making, an ability that plays a crucial role in social interaction between people, including linguistic communication. This paper investigates to what extent recent Large Language Models in the ChatGPT tradition possess ToM. We posed six well-known problems that address biases in human reasoning and decision making to two versions of ChatGPT and we compared the results under a range of prompting strategies. While the results concerning ChatGPT-3 were somewhat inconclusive, ChatGPT-4 was shown to arrive at the correct answers more often than would be expected based on chance, although correct answers were often arrived at on the basis of false assumptions or invalid reasoning.
翻译:心智理论(Theory of Mind, ToM)是指理解人类思维与决策的能力,这一能力在人际社交互动(包括语言交流)中起着关键作用。本文探究了基于ChatGPT系列的最新大语言模型在多大程度上具备心智理论。我们选取了六个涉及人类推理与决策偏差的经典问题,对两个版本的ChatGPT进行测试,并在多种提示策略下比较其结果。尽管关于ChatGPT-3的结果尚不明确,但ChatGPT-4得出正确答案的频率显著高于随机水平,不过其正确结论往往建立在错误假设或无效推理之上。