Generating novel experimental hypotheses from language models: A case study on cross-dative generalization

Neural network language models (LMs) have been shown to successfully capture complex linguistic knowledge. However, their utility for understanding language acquisition is still debated. We contribute to this debate by presenting a case study where we use LMs as simulated learners to derive novel experimental hypotheses to be tested with humans. We apply this paradigm to study cross-dative generalization (CDG): productive generalization of novel verbs across dative constructions (she pilked me the ball/she pilked the ball to me)--acquisition of which is known to involve a large space of contextual features--using LMs trained on child-directed speech. We specifically ask: "what properties of the training exposure facilitate a novel verb's generalization to the (unmodeled) alternate construction?" To answer this, we systematically vary the exposure context in which a novel dative verb occurs in terms of the properties of the theme and recipient, and then analyze the LMs' usage of the novel verb in the unmodeled dative construction. We find LMs to replicate known patterns of children's CDG, as a precondition to exploring novel hypotheses. Subsequent simulations reveal a nuanced role of the features of the novel verbs' exposure context on the LMs' CDG. We find CDG to be facilitated when the first postverbal argument of the exposure context is pronominal, definite, short, and conforms to the prototypical animacy expectations of the exposure dative. These patterns are characteristic of harmonic alignment in datives, where the argument with features ranking higher on the discourse prominence scale tends to precede the other. This gives rise to a novel hypothesis that CDG is facilitated insofar as the features of the exposure context--in particular, its first postverbal argument--are harmonically aligned. We conclude by proposing future experiments that can test this hypothesis in children.

翻译：神经网络语言模型已被证明能够成功捕捉复杂的语言学知识。然而，其对于理解语言习得的效用仍存争议。我们通过一项案例研究参与此辩论：使用语言模型作为模拟学习者，推导出可供人类测试的新颖实验假设。我们将此范式应用于研究跨与格泛化——即新颖动词在与格结构间的能产性泛化（如"她给我皮尔克了球"/"她把球皮尔克给了我"），其习得过程已知涉及大量语境特征——使用在儿童导向语料上训练的语言模型。我们具体探究："训练暴露的哪些特性会促进新颖动词向（未建模的）交替结构的泛化？"为此，我们系统性地改变新颖与格动词在暴露语境中的出现方式，调整其主题论元和接受者论元的属性，继而分析语言模型在未建模与格结构中使用该新颖动词的模式。我们发现语言模型能够复现儿童跨与格泛化的已知规律，这为探索新假设提供了前提。后续模拟揭示了暴露语境特征对语言模型跨与格泛化的微妙影响：当暴露语境的首个动词后论元具有代词性、确定性、简短性且符合暴露与格结构的典型生命度预期时，跨与格泛化更易发生。这些模式体现了与格结构中和谐对齐的特征——在语篇突显层级上排名更高的论元倾向于前置。由此催生了一个新假设：当暴露语境（特别是其首个动词后论元）的特征实现和谐对齐时，跨与格泛化将得到促进。最后我们提出了可在儿童群体中检验该假设的未来实验方案。