This study investigates whether division on political topics is mapped with the distinctive patterns of language use. We collect a total 145,832 Reddit comments on the abortion debate and explore the languages of subreddit communities r/prolife and r/prochoice. With consideration of the Moral Foundations Theory, we examine lexical patterns in three ways. First, we compute proportional frequencies of lexical items from the Moral Foundations Dictionary in order to make inferences about each group's moral considerations when forming arguments for and against abortion. We then create n-gram models to reveal frequent collocations from each stance group and better understand how commonly used words are patterned in their linguistic context and in relation to morality values. Finally, we use Latent Dirichlet Allocation to identify underlying topical structures in the corpus data. Results show that the use of morality words is mapped with the stances on abortion.
翻译:本研究探讨政治议题的分歧是否与语言使用的独特模式相关联。我们收集了总计145,832条关于堕胎辩论的Reddit评论,并分析了r/prolife与r/prochoice两个子社区的语言特征。基于道德基础理论,我们从三个维度考察词汇模式:首先,通过计算道德基础词典中词汇的比例频率,推断各群体在构建支持或反对堕胎论点时的道德考量;其次,构建n-gram模型以揭示各立场群体的高频搭配模式,深入理解常用词汇在其语言语境中如何与道德价值观形成关联;最后,运用潜在狄利克雷分布模型识别语料库中的潜在主题结构。结果表明,道德词汇的使用与堕胎立场存在系统性对应关系。