This thesis is a corpus-based, quantitative, and typological analysis of the functions of Early Slavic participle constructions and their finite competitors ($jegda$-'when'-clauses). The first part leverages detailed linguistic annotation on Early Slavic corpora at the morphosyntactic, dependency, information-structural, and lexical levels to obtain indirect evidence for different potential functions of participle clauses and their main finite competitor and understand the roles of compositionality and default discourse reasoning as explanations for the distribution of participle constructions and $jegda$-clauses in the corpus. The second part uses massively parallel data to analyze typological variation in how languages express the semantic space of English $when$, whose scope encompasses that of Early Slavic participle constructions and $jegda$-clauses. Probabilistic semantic maps are generated and statistical methods (including Kriging, Gaussian Mixture Modelling, precision and recall analysis) are used to induce cross-linguistically salient dimensions from the parallel corpus and to study conceptual variation within the semantic space of the hypothetical concept WHEN.
翻译:本论文是一项基于语料库的、对早期斯拉夫语分词结构及其有限竞争者($jegda$-“当”从句)功能的定量与类型学分析。第一部分利用早期斯拉夫语语料库在形态句法、依存关系、信息结构和词汇层面的精细语言标注,获取不同潜在功能的间接证据,以揭示分词结构及其主要有限竞争从句的功能差异,并理解组合性与默认话语推理在解释语料库中分词结构和$jegda$从句分布中的作用。第二部分采用大规模平行语料,分析不同语言在表达英语$when$(其语义范围涵盖早期斯拉夫语分词结构和$jegda$从句)语义空间时的类型学差异。通过生成概率语义图,并运用统计方法(包括克里金法、高斯混合模型、精确率与召回率分析),从平行语料中归纳跨语言显著维度,并研究假设概念WHEN语义空间内的概念变异。