Aligning Human-AI-Interaction Trust for Mental Health Support: Survey and Position for Multi-Stakeholders

Building trustworthy AI systems for mental health support is a shared priority across stakeholders from multiple disciplines. However, "trustworthy" remains loosely defined and inconsistently operationalized. AI research often focuses on technical criteria (e.g., robustness, explainability, and safety), while therapeutic practitioners emphasize therapeutic fidelity (e.g., appropriateness, empathy, and long-term user outcomes). To bridge the fragmented landscape, we propose a three-layer trust framework, covering human-oriented, AI-oriented, and interaction-oriented trust, integrating the viewpoints of key stakeholders (e.g., practitioners, researchers, regulators). Using this framework, we systematically review existing AI-driven research in mental health domain and examine evaluation practices for ``trustworthy'' ranging from automatic metrics to clinically validated approaches. We highlight critical gaps between what NLP currently measures and what real-world mental health contexts require, and outline a research agenda for building socio-technically aligned and genuinely trustworthy AI for mental health support.

翻译：构建值得信赖的心理健康支持人工智能系统，是跨学科利益相关方的共同优先事项。然而，"值得信赖"的定义仍较为模糊，且操作化方式不一致。人工智能研究通常侧重于技术标准（如鲁棒性、可解释性和安全性），而治疗从业者则强调治疗保真度（如适当性、共情和长期用户结果）。为弥合这一碎片化研究格局，我们提出一个三层信任框架，涵盖人类导向、人工智能导向和交互导向的信任，整合了关键利益相关方（如从业者、研究人员、监管机构）的观点。利用该框架，我们系统性地回顾了心理健康领域现有的人工智能驱动研究，并考察了从自动评估指标到临床验证方法等对"值得信赖"的评估实践。我们指出了当前自然语言处理测量内容与现实世界心理健康情境需求之间的关键差距，并概述了旨在构建社会技术对齐且真正值得信赖的心理健康支持人工智能的研究议程。

相关内容

健康

关注 27

健康是指一个人在身体、精神和社会等方面都处于良好的状态。健康包括两个方面的内容：

一是主要脏器无疾病，身体形态发育良好，体形均匀，人体各系统具有良好的生理功能，有较强的身体活动能力和劳动能力，这是对健康最基本的要求；

二是对疾病的抵抗能力较强，能够适应环境变化，各种生理刺激以及致病因素对身体的作用。传统的健康观是“无病即健康”，现代人的健康观是整体健康，世界卫生组织提出“健康不仅是躯体没有疾病，还要具备心理健康、社会适应良好和有道德”。因此，现代人的健康内容包括：躯体健康、心理健康、心灵健康、社会健康、智力健康、道德健康、环境健康等。健康是人的基本权利。健康是人生的第一财富。

《可信人工智能赋能系统的支柱》

专知会员服务

21+阅读 · 2月26日

《人工智能与国际安全：理解风险并为建立信任措施铺平道路》最新65页报告

专知会员服务

26+阅读 · 2024年1月14日

《信任与人机协作》128页论文

专知会员服务

52+阅读 · 2023年11月22日

人机协作《评估影响信任校准的因素：信任战略和风险的影响》美空军21页报告

专知会员服务

32+阅读 · 2023年7月18日