This work is intended as a voice in the discussion over the recent claims that LaMDA, a pretrained language model based on the Transformer model architecture, is sentient. This claim, if confirmed, would have serious ramifications in the Natural Language Processing (NLP) community due to wide-spread use of similar models. However, here we take the position that such a language model cannot be sentient, or conscious, and that LaMDA in particular exhibits no advances over other similar models that would qualify it. We justify this by analysing the Transformer architecture through Integrated Information Theory. We see the claims of consciousness as part of a wider tendency to use anthropomorphic language in NLP reporting. Regardless of the veracity of the claims, we consider this an opportune moment to take stock of progress in language modelling and consider the ethical implications of the task. In order to make this work helpful for readers outside the NLP community, we also present the necessary background in language modelling.
翻译:本文旨在就近期关于LaMDA(一种基于Transformer模型架构的预训练语言模型)具有感知能力的说法进行讨论。若此说法得到证实,由于类似模型的广泛使用,将对自然语言处理(NLP)领域产生深远影响。然而,本文持相反立场:此类语言模型不具备感知能力或意识,且LaMDA相较于其他同类模型并无实质性进步。我们通过整合信息论分析Transformer架构来论证这一观点。我们认为,关于意识的主张是NLP报告中使用拟人化语言这一普遍倾向的体现。无论这些主张是否属实,我们认为此刻正是评估语言建模进展并思考该任务伦理影响的恰当时机。为使本文对NLP领域外的读者也有助益,我们同时提供了语言建模的必要背景知识。