An International Consortium for Evaluations of Societal-Scale Risks from Advanced AI

Ross Gruetzemacher,Alan Chan,Kevin Frazier,Christy Manning,Štěpán Los,James Fox,José Hernández-Orallo,John Burden,Matija Franklin,Clíodhna Ní Ghuidhir,Mark Bailey,Daniel Eth,Toby Pilditch,Kyle Kilian

from arxiv, 50 pages, 2 figures; updated w/ a few minor revisions based on feedback from SoLaR Workshop reviewers (on 5 page version)

Given rapid progress toward advanced AI and risks from frontier AI systems (advanced AI systems pushing the boundaries of the AI capabilities frontier), the creation and implementation of AI governance and regulatory schemes deserves prioritization and substantial investment. However, the status quo is untenable and, frankly, dangerous. A regulatory gap has permitted AI labs to conduct research, development, and deployment activities with minimal oversight. In response, frontier AI system evaluations have been proposed as a way of assessing risks from the development and deployment of frontier AI systems. Yet, the budding AI risk evaluation ecosystem faces significant coordination challenges, such as a limited diversity of evaluators, suboptimal allocation of effort, and perverse incentives. This paper proposes a solution in the form of an international consortium for AI risk evaluations, comprising both AI developers and third-party AI risk evaluators. Such a consortium could play a critical role in international efforts to mitigate societal-scale risks from advanced AI, including in managing responsible scaling policies and coordinated evaluation-based risk response. In this paper, we discuss the current evaluation ecosystem and its shortcomings, propose an international consortium for advanced AI risk evaluations, discuss issues regarding its implementation, discuss lessons that can be learnt from previous international institutions and existing proposals for international AI governance institutions, and, finally, we recommend concrete steps to advance the establishment of the proposed consortium: (i) solicit feedback from stakeholders, (ii) conduct additional research, (iii) conduct a workshop(s) for stakeholders, (iv) analyze feedback and create final proposal, (v) solicit funding, and (vi) create a consortium.

翻译：鉴于高级人工智能的快速进展及其前沿系统（突破人工智能能力边界的高级AI系统）带来的风险，人工智能治理与监管机制的设计与实施应获得优先关注和重大投入。然而，当前现状既不可持续，坦率而言更存在危险。监管真空使得人工智能实验室在开展研究、开发及部署活动时几乎不受监督。针对这一现状，前沿AI系统评估已被提议作为评估前沿AI系统开发与部署风险的手段。然而，初具规模的AI风险评估生态系统面临重大协作挑战，例如评估主体多样性不足、资源配置不优化以及激励扭曲。本文提出以国际AI风险评估联合体为解决方案，该联合体由AI开发者与第三方AI风险评估机构共同构成。此类联合体可在缓解高级人工智能引发的社会级风险的国际努力中发挥关键作用，包括管理负责任扩展政策及协调基于评估的风险应对机制。本文首先剖析当前评估生态系统的缺陷，继而提出构建高级人工智能风险评估国际联合体的方案，讨论其落实中的相关议题，借鉴国际组织先例及现有国际AI治理机构提案中的经验教训，最终提出推进该联合体建设的具体步骤：(i) 征求利益相关方反馈，(ii) 开展补充研究，(iii) 组织利益相关方工作坊，(iv) 整合反馈形成最终方案，(v) 筹措资金，(vi) 正式成立联合体。

相关内容

关注 7110

人工智能杂志AI(Artificial Intelligence)是目前公认的发表该领域最新研究成果的主要国际论坛。该期刊欢迎有关AI广泛方面的论文，这些论文构成了整个领域的进步，也欢迎介绍人工智能应用的论文，但重点应该放在新的和新颖的人工智能方法如何提高应用领域的性能，而不是介绍传统人工智能方法的另一个应用。关于应用的论文应该描述一个原则性的解决方案，强调其新颖性，并对正在开发的人工智能技术进行深入的评估。官网地址：http://dblp.uni-trier.de/db/journals/ai/

【CVPR 2022】基于元内存传输的跨域少镜头语义分割，Remember the Difference: Cross-Domain Few-Shot Semantic Segmentation via Meta-Memory Transfer

专知会员服务

13+阅读 · 2022年3月12日

O’Reilly报告：知识图谱崛起——面向现代数据集成和数据结构体系，“The Rise of the Knowledge Graph——Toward Modern Data Integration and the Data Fabric Architecture”

专知会员服务

49+阅读 · 2022年2月18日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日