MHDash：面向心理健康感知AI助手的在线基准测试平台 (MHDash: An Online Platform for Benchmarking Mental Health-Aware AI Assistants)

from arxiv, Accepted for presentation at IEEE SoutheastCon 2026. This is the author version of an accepted paper. The final version will appear in IEEE Xplore

Large language models (LLMs) are increasingly applied in mental health support systems, where reliable recognition of high-risk states such as suicidal ideation and self-harm is safety-critical. However, existing evaluations primarily rely on aggregate performance metrics, which often obscure risk-specific failure modes and provide limited insight into model behavior in realistic, multi-turn interactions. We present MHDash, an open-source platform designed to support the development, evaluation, and auditing of AI systems for mental health applications. MHDash integrates data collection, structured annotation, multi-turn dialogue generation, and baseline evaluation into a unified pipeline. The platform supports annotations across multiple dimensions, including Concern Type, Risk Level, and Dialogue Intent, enabling fine-grained and risk-aware analysis. Our results reveal several key findings: (i) simple baselines and advanced LLM APIs exhibit comparable overall accuracy yet diverge significantly on high-risk cases; (ii) some LLMs maintain consistent ordinal severity ranking while failing absolute risk classification, whereas others achieve reasonable aggregate scores but suffer from high false negative rates on severe categories; and (iii) performance gaps are amplified in multi-turn dialogues, where risk signals emerge gradually. These observations demonstrate that conventional benchmarks are insufficient for safety-critical mental health settings. By releasing MHDash as an open platform, we aim to promote reproducible research, transparent evaluation, and safety-aligned development of AI systems for mental health support.

翻译：大型语言模型（LLM）正日益应用于心理健康支持系统，其中对自杀意念与自伤等高危状态的可靠识别具有关键安全意义。然而，现有评估主要依赖综合性能指标，往往掩盖了风险特定的失效模式，且对模型在真实多轮交互中的行为洞察有限。本文提出MHDash——一个为心理健康应用场景下AI系统的开发、评估与审计设计的开源平台。该平台将数据收集、结构化标注、多轮对话生成与基线评估整合为统一流程，支持跨多维度（包括关切类型、风险等级与对话意图）的标注体系，实现细粒度且风险感知的分析。我们的实验结果揭示了若干关键发现：（i）简单基线模型与先进LLM API在整体准确率上表现相近，但在高风险案例中呈现显著差异；（ii）部分LLM能保持一致的序数严重度排序却无法完成绝对风险分类，而另一些模型虽获得合理的综合评分，却在严重类别上出现高假阴性率；（iii）性能差距在多轮对话中被放大，其中风险信号往往逐步显现。这些现象表明传统基准测试在安全关键的心理健康场景中存在不足。通过将MHDash作为开放平台发布，我们旨在推动心理健康支持领域AI系统的可复现研究、透明化评估与安全对齐发展。

相关内容

健康

关注 27

健康是指一个人在身体、精神和社会等方面都处于良好的状态。健康包括两个方面的内容：

一是主要脏器无疾病，身体形态发育良好，体形均匀，人体各系统具有良好的生理功能，有较强的身体活动能力和劳动能力，这是对健康最基本的要求；

二是对疾病的抵抗能力较强，能够适应环境变化，各种生理刺激以及致病因素对身体的作用。传统的健康观是“无病即健康”，现代人的健康观是整体健康，世界卫生组织提出“健康不仅是躯体没有疾病，还要具备心理健康、社会适应良好和有道德”。因此，现代人的健康内容包括：躯体健康、心理健康、心灵健康、社会健康、智力健康、道德健康、环境健康等。健康是人的基本权利。健康是人生的第一财富。

迈向个性化大语言模型驱动的智能体：基础、评估与未来方向

专知会员服务

26+阅读 · 2月27日

智能体评判者（Agent-as-a-Judge）研究综述

专知会员服务

37+阅读 · 1月9日

基于强化学习的智能体化搜索全面综述：基础、角色、优化、评估与应用

专知会员服务

23+阅读 · 2025年10月22日

《战争迷雾中的红线与灰色地带：基于大语言模型的军事决策风险、区域偏见基准测试》2025最新54页报告

专知会员服务

35+阅读 · 2025年10月10日