Case Repositories: Towards Case-Based Reasoning for AI Alignment

Case studies commonly form the pedagogical backbone in law, ethics, and many other domains that face complex and ambiguous societal questions informed by human values. Similar complexities and ambiguities arise when we consider how AI should be aligned in practice: when faced with vast quantities of diverse (and sometimes conflicting) values from different individuals and communities, with whose values is AI to align, and how should AI do so? We propose a complementary approach to constitutional AI alignment, grounded in ideas from case-based reasoning (CBR), that focuses on the construction of policies through judgments on a set of cases. We present a process to assemble such a case repository by: 1) gathering a set of ``seed'' cases -- questions one may ask an AI system -- in a particular domain, 2) eliciting domain-specific key dimensions for cases through workshops with domain experts, 3) using LLMs to generate variations of cases not seen in the wild, and 4) engaging with the public to judge and improve cases. We then discuss how such a case repository could assist in AI alignment, both through directly acting as precedents to ground acceptable behaviors, and as a medium for individuals and communities to engage in moral reasoning around AI.

翻译：案例研究通常是法学、伦理学以及许多其他领域中教学的基础，这些领域面临着人类价值观所影响的复杂且模糊的社会问题。当我们思考AI在实践中应如何对齐时，也会出现类似的复杂性和模糊性：当面对来自不同个体和社区的海量多样化（有时相互冲突）价值观时，AI应与谁的价值观对齐，以及应如何对齐？我们提出了一种基于案例推理（CBR）思想的宪法AI对齐的补充方法，该方法侧重于通过对一组案例的评判来构建策略。我们提出了一个构建此类案例库的流程：1）在特定领域中收集一组“种子”案例——即人们可能向AI系统提出的问题；2）通过与领域专家的研讨会，提炼出特定领域的关键维度；3）利用LLMs生成未见过的案例变体；4）让公众参与评判和改进案例。然后，我们讨论了这样的案例库如何能够辅助AI对齐，既可以直接作为先例来约束可接受的行为，也可以作为个体和社区围绕AI进行道德推理的媒介。

相关内容

CASES

关注 4

CASES：International Conference on Compilers, Architectures, and Synthesis for Embedded Systems。 Explanation：嵌入式系统编译器、体系结构和综合国际会议。 Publisher：ACM。 SIT： http://dblp.uni-trier.de/db/conf/cases/index.html

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

【亚马逊-WWW2020】不解析,生成!用于面向任务的语义分析的序列到序列体系结构，Don't Parse, Generate! A Sequence to Sequence Architecture for Task-Oriented Semantic Parsing

专知会员服务

15+阅读 · 2020年2月1日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

35+阅读 · 2019年10月18日