Given rapid progress toward advanced AI and risks from frontier AI systems (advanced AI systems pushing the boundaries of the AI capabilities frontier), the creation and implementation of AI governance and regulatory schemes deserves prioritization and substantial investment. However, the status quo is untenable and, frankly, dangerous. A regulatory gap has permitted AI labs to conduct research, development, and deployment activities with minimal oversight. In response, frontier AI system evaluations have been proposed as a way of assessing risks from the development and deployment of frontier AI systems. Yet, the budding AI risk evaluation ecosystem faces significant coordination challenges, such as a limited diversity of evaluators, suboptimal allocation of effort, and perverse incentives. This paper proposes a solution in the form of an international consortium for AI risk evaluations, comprising both AI developers and third-party AI risk evaluators. Such a consortium could play a critical role in international efforts to mitigate societal-scale risks from advanced AI, including in managing responsible scaling policies and coordinated evaluation-based risk response. In this paper, we discuss the current evaluation ecosystem and its shortcomings, propose an international consortium for advanced AI risk evaluations, discuss issues regarding its implementation, discuss lessons that can be learnt from previous international institutions and existing proposals for international AI governance institutions, and, finally, we recommend concrete steps to advance the establishment of the proposed consortium: (i) solicit feedback from stakeholders, (ii) conduct additional research, (iii) conduct a workshop(s) for stakeholders, (iv) analyze feedback and create final proposal, (v) solicit funding, and (vi) create a consortium.
翻译:鉴于高级人工智能的快速进展及其前沿系统(突破人工智能能力边界的高级AI系统)带来的风险,人工智能治理与监管机制的设计与实施应获得优先关注和重大投入。然而,当前现状既不可持续,坦率而言更存在危险。监管真空使得人工智能实验室在开展研究、开发及部署活动时几乎不受监督。针对这一现状,前沿AI系统评估已被提议作为评估前沿AI系统开发与部署风险的手段。然而,初具规模的AI风险评估生态系统面临重大协作挑战,例如评估主体多样性不足、资源配置不优化以及激励扭曲。本文提出以国际AI风险评估联合体为解决方案,该联合体由AI开发者与第三方AI风险评估机构共同构成。此类联合体可在缓解高级人工智能引发的社会级风险的国际努力中发挥关键作用,包括管理负责任扩展政策及协调基于评估的风险应对机制。本文首先剖析当前评估生态系统的缺陷,继而提出构建高级人工智能风险评估国际联合体的方案,讨论其落实中的相关议题,借鉴国际组织先例及现有国际AI治理机构提案中的经验教训,最终提出推进该联合体建设的具体步骤:(i) 征求利益相关方反馈,(ii) 开展补充研究,(iii) 组织利益相关方工作坊,(iv) 整合反馈形成最终方案,(v) 筹措资金,(vi) 正式成立联合体。