Given rapid progress toward advanced AI and risks from frontier AI systems (advanced AI systems pushing the boundaries of the AI capabilities frontier), the creation and implementation of AI governance and regulatory schemes deserves prioritization and substantial investment. However, the status quo is untenable and, frankly, dangerous. A regulatory gap has permitted AI labs to conduct research, development, and deployment activities with minimal oversight. In response, frontier AI system evaluations have been proposed as a way of assessing risks from the development and deployment of frontier AI systems. Yet, the budding AI risk evaluation ecosystem faces significant coordination challenges, such as a limited diversity of evaluators, suboptimal allocation of effort, and perverse incentives. This paper proposes a solution in the form of an international consortium for AI risk evaluations, comprising both AI developers and third-party AI risk evaluators. Such a consortium could play a critical role in international efforts to mitigate societal-scale risks from advanced AI, including in managing responsible scaling policies and coordinated evaluation-based risk response. In this paper, we discuss the current evaluation ecosystem and its shortcomings, propose an international consortium for advanced AI risk evaluations, discuss issues regarding its implementation, discuss lessons that can be learnt from previous international institutions and existing proposals for international AI governance institutions, and, finally, we recommend concrete steps to advance the establishment of the proposed consortium: (i) solicit feedback from stakeholders, (ii) conduct additional research, (iii) conduct a workshop(s) for stakeholders, (iv) analyze feedback and create final proposal, (v) solicit funding, and (vi) create a consortium.
翻译:鉴于高级AI的快速发展及前沿AI系统(即不断突破AI能力边界的高级AI系统)所带来的风险,AI治理与监管机制的创建和实施亟需优先考虑并投入大量资源。然而,现状难以为继,坦言之,危机四伏。监管真空使得AI实验室在研发、部署等活动中几乎不受约束。为此,前沿AI系统评估作为一种衡量前沿AI系统开发与部署风险的机制被提出。然而,新兴的AI风险评估体系面临显著的协调难题,例如评估主体多样性不足、资源配置效率低下以及激励扭曲等问题。本文提出解决方案:组建一个由AI开发方与第三方AI风险评估方共同参与的国际联合体。该联合体可在缓解高级AI社会层面风险的全球行动中发挥关键作用,包括实施负责任规模扩展策略与协调一致的评估风险响应。本文首先剖析现有评估体系及其缺陷,继而提出高级AI风险评估国际联合体方案,并探讨其落地实施的相关议题;随后回顾既往国际机构治理实践及现有AI国际治理提案的可借鉴经验;最终提出推动该联合体成立的具体行动建议:(一)征求利益相关方意见,(二)开展补充研究,(三)举办利益相关方研讨会,(四)分析反馈并形成最终方案,(五)筹措资金,(六)正式成立联合体。