Languages may encode similar meanings using different sentence structures. This makes it a challenge to provide a single set of formal rules that can derive meanings from sentences in many languages at once. To overcome the challenge, we can take advantage of language-general connections between meaning and syntax, and build on cross-linguistically parallel syntactic structures. We introduce UD Type Calculus, a compositional, principled, and language-independent system of semantic types and logical forms for lexical items which builds on a widely-used language-general dependency syntax framework. We explain the essential features of UD Type Calculus, which all involve giving dependency relations denotations just like those of words. These allow UD-TC to derive correct meanings for sentences with a wide range of syntactic structures by making use of dependency labels. Finally, we present evaluation results on a large existing corpus of sentences and their logical forms, showing that UD-TC can produce meanings comparable with our baseline.
翻译:语言可能使用不同的句子结构编码相似含义。这给同时从多种语言的句子推导出含义的单一形式规则集带来了挑战。为克服这一挑战,我们可以利用意义与句法之间的语言通用关联,并基于跨语言平行的句法结构。我们引入UD类型演算(UD Type Calculus)——一个基于广泛使用的语言通用依存句法框架、具有组合性、原则性且语言无关的语义类型与逻辑形式系统。我们阐释UD类型演算的核心特性,这些特性均涉及赋予依存关系以类似词汇的指称。通过利用依存标签,UD-TC能够为具有广泛句法结构的句子推导出正确含义。最后,我们基于包含句子及其逻辑形式的大型现有语料库进行评测,结果表明UD-TC可生成与基线模型相当的含义。