Over the past decade, studies of naturalistic language processing where participants are scanned while listening to continuous text have flourished. Using word embeddings at first, then large language models, researchers have created encoding models to analyze the brain signals. Presenting these models with the same text as the participants allows to identify brain areas where there is a significant correlation between the functional magnetic resonance imaging (fMRI) time series and the ones predicted by the models' artificial neurons. One intriguing finding from these studies is that they have revealed highly symmetric bilateral activation patterns, somewhat at odds with the well-known left lateralization of language processing. Here, we report analyses of an fMRI dataset where we manipulate the complexity of large language models, testing 28 pretrained models from 8 different families, ranging from 124M to 14.2B parameters. First, we observe that the performance of models in predicting brain responses follows a scaling law, where the fit with brain activity increases linearly with the logarithm of the number of parameters of the model (and its performance on natural language processing tasks). Second, we show that a left-right asymmetry gradually appears as model size increases, and that the difference in left-right brain correlations also follows a scaling law. Whereas the smallest models show no asymmetry, larger models fit better and better left hemispheric activations than right hemispheric ones. This finding reconciles computational analyses of brain activity using large language models with the classic observation from aphasic patients showing left hemisphere dominance for language.
翻译:过去十年间,自然语言处理的神经影像研究蓬勃发展,这类研究让参与者在聆听连续文本的同时接受脑部扫描。研究者最初使用词嵌入技术,随后采用大型语言模型构建编码模型来分析大脑信号。通过向这些模型输入与参与者相同的文本,可以识别出功能磁共振成像(fMRI)时间序列与模型人工神经元预测序列存在显著相关性的脑区。这些研究揭示了一个引人注目的发现:它们呈现出高度对称的双侧激活模式,这与语言处理具有明确左半球优势的经典认知存在一定矛盾。本研究通过操控大型语言模型的复杂度,对fMRI数据集进行分析,测试了来自8个不同架构系列的28个预训练模型,参数量范围从1.24亿到142亿不等。首先,我们发现模型预测大脑反应的能力遵循缩放定律:模型与大脑活动的拟合度随参数数量(及其在自然语言处理任务上的性能)的对数线性增长。其次,我们证明随着模型规模增大,左右半球不对称性逐渐显现,且左右脑相关性的差异同样遵循缩放定律。最小模型未表现出不对称性,而较大模型对左半球激活的拟合度持续优于右半球。这一发现弥合了基于大型语言模型的脑活动计算分析与失语症患者研究之间的分歧,后者历来表明左半球在语言功能中占主导地位。