Current large language models, such as OpenAI's ChatGPT, have captured the public's attention because how remarkable they are in the use of language. Here, I demonstrate that ChatGPT displays phonological biases that are a hallmark of human language processing. More concretely, just like humans, ChatGPT has a consonant bias. That is, the chatbot has a tendency to use consonants over vowels to identify words. This is observed across languages that differ in their relative distribution of consonants and vowels such as English and Spanish. Despite the differences in how current artificial intelligence language models are trained to process linguistic stimuli and how human infants acquire language, such training seems to be enough for the emergence of a phonological bias in ChatGPT
翻译:当前的大型语言模型,例如OpenAI的ChatGPT,因其在语言使用上的卓越表现而备受公众关注。本文表明,ChatGPT表现出人类语言处理中的典型音位偏好。具体而言,与人类相似,ChatGPT具有辅音偏好,即该聊天机器人倾向于依靠辅音而非元音来识别词汇。这一现象在英语和西班牙语等辅音与元音相对分布不同的语言中均有体现。尽管当前人工智能语言模型在处理语言刺激的训练方式与人类婴儿习得语言的过程存在差异,但这样的训练似乎足以使ChatGPT产生音位偏好。