Calibration of Transformer-based Models for Identifying Stress and Depression in Social Media

In today's fast-paced world, the rates of stress and depression present a surge. Social media provide assistance for the early detection of mental health conditions. Existing methods mainly introduce feature extraction approaches and train shallow machine learning classifiers. Other researches use deep neural networks or transformers. Despite the fact that transformer-based models achieve noticeable improvements, they cannot often capture rich factual knowledge. Although there have been proposed a number of studies aiming to enhance the pretrained transformer-based models with extra information or additional modalities, no prior work has exploited these modifications for detecting stress and depression through social media. In addition, although the reliability of a machine learning model's confidence in its predictions is critical for high-risk applications, there is no prior work taken into consideration the model calibration. To resolve the above issues, we present the first study in the task of depression and stress detection in social media, which injects extra linguistic information in transformer-based models, namely BERT and MentalBERT. Specifically, the proposed approach employs a Multimodal Adaptation Gate for creating the combined embeddings, which are given as input to a BERT (or MentalBERT) model. For taking into account the model calibration, we apply label smoothing. We test our proposed approaches in three publicly available datasets and demonstrate that the integration of linguistic features into transformer-based models presents a surge in the performance. Also, the usage of label smoothing contributes to both the improvement of the model's performance and the calibration of the model. We finally perform a linguistic analysis of the posts and show differences in language between stressful and non-stressful texts, as well as depressive and non-depressive posts.

翻译：在当今快节奏的世界中，压力与抑郁的发病率呈现上升趋势。社交媒体为心理健康状况的早期检测提供了辅助手段。现有方法主要引入特征提取技术并训练浅层机器学习分类器，另一些研究则采用深度神经网络或Transformer。尽管基于Transformer的模型取得了显著改进，但它们往往无法捕捉丰富的事实知识。虽然已有诸多研究致力于通过额外信息或附加模态来增强预训练的Transformer模型，但尚无前期工作利用这些改进手段通过社交媒体检测压力与抑郁。此外，尽管机器学习模型对其预测置信度的可靠性对于高风险应用至关重要，但此前尚无研究考虑模型校准问题。为解决上述问题，我们首次在社交媒体的抑郁与压力检测任务中，向基于Transformer的模型（即BERT和MentalBERT）注入额外的语言信息。具体而言，所提出的方法采用多模态适应门来生成组合嵌入，并将其作为BERT（或MentalBERT）模型的输入。为考虑模型校准，我们应用标签平滑技术。我们在三个公开数据集上测试所提出的方法，并证明将语言特征集成到基于Transformer的模型中能显著提升性能。同时，标签平滑的使用有助于改善模型性能并提升校准效果。最后，我们对帖文进行语言学分析，展示压力与非压力文本、抑郁与非抑郁帖文之间的语言差异。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

52+阅读 · 2020年12月14日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日