Early diagnosis of mental disorders and intervention can facilitate the prevention of severe injuries and the improvement of treatment results. Using social media and pre-trained language models, this study explores how user-generated data can be used to predict mental disorder symptoms. Our study compares four different BERT models of Hugging Face with standard machine learning techniques used in automatic depression diagnosis in recent literature. The results show that new models outperform the previous approach with an accuracy rate of up to 97%. Analyzing the results while complementing past findings, we find that even tiny amounts of data (like users' bio descriptions) have the potential to predict mental disorders. We conclude that social media data is an excellent source of mental health screening, and pre-trained models can effectively automate this critical task.
翻译:早期诊断心理障碍并进行干预有助于预防严重伤害并改善治疗效果。本研究利用社交媒体和预训练语言模型,探讨如何基于用户生成数据预测心理障碍症状。我们比较了Hugging Face平台上的四种不同BERT模型与近年文献中用于自动抑郁诊断的标准机器学习技术。结果显示,新模型以高达97%的准确率超越了先前方法。在结合已有发现分析结果时,我们发现即使极少量的数据(如用户的个人简介描述)也具有预测心理障碍的潜力。我们得出结论:社交媒体数据是心理健康筛查的优质来源,而预训练模型可有效实现这一关键任务的自动化。