This thesis is a corpus-based, quantitative, and typological analysis of the functions of Early Slavic participle constructions and their finite competitors ($jegda$-'when'-clauses). The first part leverages detailed linguistic annotation on Early Slavic corpora at the morphosyntactic, dependency, information-structural, and lexical levels to obtain indirect evidence for different potential functions of participle clauses and their main finite competitor and understand the roles of compositionality and default discourse reasoning as explanations for the distribution of participle constructions and $jegda$-clauses in the corpus. The second part uses massively parallel data to analyze typological variation in how languages express the semantic space of English $when$, whose scope encompasses that of Early Slavic participle constructions and $jegda$-clauses. Probabilistic semantic maps are generated and statistical methods (including Kriging, Gaussian Mixture Modelling, precision and recall analysis) are used to induce cross-linguistically salient dimensions from the parallel corpus and to study conceptual variation within the semantic space of the hypothetical concept WHEN.
翻译:本文基于语料库,采用量化与类型学方法,分析了早期斯拉夫语分词结构及其有限从句竞争者($jegda$-‘当’-从句)的功能。第一部分利用早期斯拉夫语语料库中形态句法、依存关系、信息结构与词汇层面的详细语言标注,获取分词从句及其主要有限从句竞争者潜在不同功能的间接证据,并理解组合性与默认话语推理在解释语料库中分词结构与$jegda$-从句分布中的作用。第二部分使用大规模平行语料,分析语言如何表达英语$when$的语义空间(该语义空间涵盖早期斯拉夫语分词结构与$jegda$-从句的范围)的类型学差异。通过生成概率语义地图,并运用统计方法(包括克里金法、高斯混合模型、精确率与召回率分析),从平行语料库中归纳跨语言显著维度,研究假设概念WHEN语义空间内的概念变异。