Multi-class Regret Detection in Hindi Devanagari Script

The number of Hindi speakers on social media has increased dramatically in recent years. Regret is a common emotional experience in our everyday life. Many speakers on social media, share their regretful experiences and opinions regularly. It might cause a re-evaluation of one's choices and a desire to make a different option if given the chance. As a result, knowing the source of regret is critical for investigating its impact on behavior and decision-making. This study focuses on regret and how it is expressed, specifically in Hindi, on various social media platforms. In our study, we present a novel dataset from three different sources, where each sentence has been manually classified into one of three classes "Regret by action", "Regret by inaction", and "No regret". Next, we use this dataset to investigate the linguistic expressions of regret in Hindi text and also identify the textual domains that are most frequently associated with regret. Our findings indicate that individuals on social media platforms frequently express regret for both past inactions and actions, particularly within the domain of interpersonal relationships. We use a pre-trained BERT model to generate word embeddings for the Hindi dataset and also compare deep learning models with conventional machine learning models in order to demonstrate accuracy. Our results show that BERT embedding with CNN consistently surpassed other models. This described the effectiveness of BERT for conveying the context and meaning of words in the regret domain.

翻译：近年来，社交媒体上印地语使用者的数量急剧增加。悔恨是我们日常生活中常见的情感体验。许多社交媒体用户经常分享他们的悔恨经历和观点。这可能导致对自己选择的重新评估，以及若有機會可能做出不同选择的愿望。因此，了解悔恨的根源对于研究其对行为和决策的影响至关重要。本研究聚焦于悔恨及其表达方式，特别是在各种社交媒体平台上的印地语表达。在我们的研究中，我们提出了一个来自三个不同来源的新数据集，其中每个句子被人工分类为三类之一：“因行动而悔恨”、“因不作为而悔恨”和“无悔恨”。接下来，我们利用该数据集研究印地语文本中悔恨的语言表达方式，并识别出最常与悔恨相关的文本领域。我们的研究结果表明，社交媒体平台上的个体经常表达对过去不作为和行动的悔恨，尤其是在人际关系领域。我们使用预训练的BERT模型为印地语数据集生成词嵌入，并比较深度学习模型与传统机器学习模型以展示准确性。我们的结果表明，基于CNN的BERT嵌入始终优于其他模型。这揭示了BERT在传达悔恨领域中词语上下文和含义方面的有效性。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日