Current large language models, such as OpenAI's ChatGPT, have captured the public's attention because how remarkable they are in the use of language. Here, I demonstrate that ChatGPT displays phonological biases that are a hallmark of human language processing. More concretely, just like humans, ChatGPT has a consonant bias. That is, the chatbot has a tendency to use consonants over vowels to identify words. This is observed across languages that differ in their relative distribution of consonants and vowels such as English and Spanish. Despite the differences in how current artificial intelligence language models are trained to process linguistic stimuli and how human infants acquire language, such training seems to be enough for the emergence of a phonological bias in ChatGPT
翻译:当前的大型语言模型,如OpenAI的ChatGPT,因其在语言运用方面的卓越表现而引起了公众的关注。本文中,我证明了ChatGPT表现出人类语言处理中特有的语音偏见。更具体地说,与人类类似,ChatGPT具有辅音偏见。也就是说,该聊天机器人倾向于使用辅音而非元音来识别单词。这一现象在英语和西班牙语等辅音与元音分布比例不同的语言中均被观察到。尽管当前人工智能语言模型处理语言刺激的训练方式与人类婴儿习得语言的过程存在差异,但此类训练似乎足以使ChatGPT产生语音偏见。