This thesis is a corpus-based, quantitative, and typological analysis of the functions of Early Slavic participle constructions and their finite competitors ($jegda$-'when'-clauses). The first part leverages detailed linguistic annotation on Early Slavic corpora at the morphosyntactic, dependency, information-structural, and lexical levels to obtain indirect evidence for different potential functions of participle clauses and their main finite competitor and understand the roles of compositionality and default discourse reasoning as explanations for the distribution of participle constructions and $jegda$-clauses in the corpus. The second part uses massively parallel data to analyze typological variation in how languages express the semantic space of English $when$, whose scope encompasses that of Early Slavic participle constructions and $jegda$-clauses. Probabilistic semantic maps are generated and statistical methods (including Kriging, Gaussian Mixture Modelling, precision and recall analysis) are used to induce cross-linguistically salient dimensions from the parallel corpus and to study conceptual variation within the semantic space of the hypothetical concept WHEN.
翻译:本论文是一项基于语料库、量化及类型学的研究,旨在分析早期斯拉夫语分词结构及其有限从句竞争者($jegda$-‘当’-从句)的功能。第一部分利用早期斯拉夫语语料库在形态句法、依存关系、信息结构和词汇层面的详细语言学标注,获取分词结构及其主要有限从句竞争者潜在不同功能的间接证据,并理解组合性与默认话语推理在解释语料库中分词结构与$jegda$-从句分布中的作用。第二部分采用大规模平行数据分析语言如何表达英语$when$的语义空间——该语义空间的范畴涵盖了早期斯拉夫语分词结构与$jegda$-从句的范围。通过生成概率语义图,并运用统计方法(包括克里金法、高斯混合模型、精确率与召回率分析),从平行语料库中归纳跨语言显著维度,并研究假设概念WHEN语义空间内的概念变异。