The moral value of liberty is a central concept in our inference system when it comes to taking a stance towards controversial social issues such as vaccine hesitancy, climate change, or the right to abortion. Here, we propose a novel Liberty lexicon evaluated on more than 3,000 manually annotated data both in in- and out-of-domain scenarios. As a result of this evaluation, we produce a combined lexicon that constitutes the main outcome of this work. This final lexicon incorporates information from an ensemble of lexicons that have been generated using word embedding similarity (WE) and compositional semantics (CS). Our key contributions include enriching the liberty annotations, developing a robust liberty lexicon for broader application, and revealing the complexity of expressions related to liberty across different platforms. Through the evaluation, we show that the difficulty of the task calls for designing approaches that combine knowledge, in an effort of improving the representations of learning systems.
翻译:自由作为一种道德价值,在我们对疫苗犹豫、气候变化或堕胎权等争议性社会议题采取立场时,是推理体系中的核心概念。本文提出了一种新颖的自由词典,并在领域内与跨领域场景下基于超过3000条人工标注数据进行了评估。通过该评估,我们构建了一个复合词典,作为本研究的主要成果。该最终词典整合了通过词嵌入相似性(WE)与组合语义(CS)方法生成的多个词典信息。我们的核心贡献包括:丰富自由相关标注体系,开发适用于更广泛场景的鲁棒性自由词典,以及揭示不同平台中自由相关表达的复杂性。评估结果表明,该任务的难度要求我们设计融合知识的分析方法,以持续改进学习系统的表征能力。