Unveiling factors influencing judgment variation in Sentiment Analysis with Natural Language Processing and Statistics

TripAdvisor reviews and comparable data sources play an important role in many tasks in Natural Language Processing (NLP), providing a data basis for the identification and classification of subjective judgments, such as hotel or restaurant reviews, into positive or negative polarities. This study explores three important factors influencing variation in crowdsourced polarity judgments, focusing on TripAdvisor reviews in Spanish. Three hypotheses are tested: the role of Part Of Speech (POS), the impact of sentiment words such as "tasty", and the influence of neutral words like "ok" on judgment variation. The study's methodology employs one-word titles, demonstrating their efficacy in studying polarity variation of words. Statistical tests on mean equality are performed on word groups of our interest. The results of this study reveal that adjectives in one-word titles tend to result in lower judgment variation compared to other word types or POS. Sentiment words contribute to lower judgment variation as well, emphasizing the significance of sentiment words in research on polarity judgments, and neutral words are associated with higher judgment variation as expected. However, these effects cannot be always reproduced in longer titles, which suggests that longer titles do not represent the best data source for testing the ambiguity of single words due to the influence on word polarity by other words like negation in longer titles. This empirical investigation contributes valuable insights into the factors influencing polarity variation of words, providing a foundation for NLP practitioners that aim to capture and predict polarity judgments in Spanish and for researchers that aim to understand factors influencing judgment variation.

翻译：TripAdvisor评论及类似数据源在自然语言处理（NLP）的诸多任务中扮演着重要角色，为识别和分类主观判断（如酒店或餐厅评论的正负面倾向）提供了数据基础。本研究以西班牙语TripAdvisor评论为焦点，探讨影响众包倾向性判断变异的三大关键因素。我们对三个假设进行验证：词性标注（POS）的作用、情感词（如“美味”）的影响，以及中性词（如“还行”）对判断变异的作用。研究方法采用单词标题，证实了其在研究词汇倾向性变异中的有效性，并对感兴趣词群进行了均值相等性统计检验。结果显示：单词标题中形容词导致的判断变异低于其他词类或词性；情感词同样降低了判断变异，凸显了情感词在倾向性判断研究中的重要性；中性词则如预期般与更高判断变异相关。然而，这些效应在较长标题中并非始终可复现，表明较长标题因否定词等其他词汇对词义倾向的影响，并非测试单词歧义的最佳数据源。本实证研究为理解词汇倾向性变异的影响因素提供了重要洞见，为旨在捕捉和预测西班牙语倾向性判断的NLP从业者，以及致力于探究判断变异影响机制的研究者奠定了坚实基础。