AffirmativeAI: Towards LGBTQ+ Friendly Audit Frameworks for Large Language Models

LGBTQ+ community face disproportionate mental health challenges, including higher rates of depression, anxiety, and suicidal ideation. Research has shown that LGBTQ+ people have been using large language model-based chatbots, such as ChatGPT, for their mental health needs. Despite the potential for immediate support and anonymity these chatbots offer, concerns regarding their capacity to provide empathetic, accurate, and affirming responses remain. In response to these challenges, we propose a framework for evaluating the affirmativeness of LLMs based on principles of affirmative therapy, emphasizing the need for attitudes, knowledge, and actions that support and validate LGBTQ+ experiences. We propose a combination of qualitative and quantitative analyses, hoping to establish benchmarks for "Affirmative AI," ensuring that LLM-based chatbots can provide safe, supportive, and effective mental health support to LGBTQ+ individuals. We benchmark LLM affirmativeness not as a mental health solution for LGBTQ+ individuals or to claim it resolves their mental health issues, as we highlight the need to consider complex discrimination in the LGBTQ+ community when designing technological aids. Our goal is to evaluate LLMs for LGBTQ+ mental health support since many in the community already use them, aiming to identify potential harms of using general-purpose LLMs in this context.

翻译：LGBTQ+群体面临不成比例的心理健康挑战，包括更高的抑郁、焦虑和自杀意念发生率。研究表明，LGBTQ+群体已开始使用基于大语言模型的聊天机器人（如ChatGPT）来满足其心理健康需求。尽管这类聊天机器人能提供即时支持和匿名性，但其提供共情、准确和肯定性回应的能力仍令人担忧。针对这些挑战，我们基于肯定性疗法原则提出了一套评估大语言模型肯定性的框架，强调需要具备支持并验证LGBTQ+经历的态度、知识和行动。我们结合定性与定量分析方法，旨在建立"肯定性AI"的基准，确保基于大语言模型的聊天机器人能为LGBTQ+群体提供安全、支持性和有效的心理健康支持。需强调的是，我们基准测试大语言模型的肯定性并非将其视为LGBTQ+群体的心理健康解决方案，也未声称它能解决该群体的心理健康问题。我们指出在设计技术辅助工具时必须考虑LGBTQ+群体中复杂的歧视现象。我们的目标在于评估大语言模型对LGBTQ+心理健康支持的适用性——鉴于该群体已广泛使用此类模型，旨在识别通用型大语言模型在此情境下的潜在危害。

相关内容

大语言模型

关注 66

大语言模型是基于海量文本数据训练的深度学习模型。它不仅能够生成自然语言文本，还能够深入理解文本含义，处理各种自然语言任务，如文本摘要、问答、翻译等。2023年，大语言模型及其在人工智能领域的应用已成为全球科技研究的热点，其在规模上的增长尤为引人注目，参数量已从最初的十几亿跃升到如今的一万亿。参数量的提升使得模型能够更加精细地捕捉人类语言微妙之处，更加深入地理解人类语言的复杂性。在过去的一年里，大语言模型在吸纳新知识、分解复杂任务以及图文对齐等多方面都有显著提升。随着技术的不断成熟，它将不断拓展其应用范围，为人类提供更加智能化和个性化的服务，进一步改善人们的生活和生产方式。

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

生成性对抗网络:理论模型、评估指标和最近发展的概述，Generative Adversarial Networks (GANs): An Overview of Theoretical Model, Evaluation Metrics, and Recent Developments

专知会员服务

42+阅读 · 2020年5月30日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日