Advanced AI models hold the promise of tremendous benefits for humanity, but society needs to proactively manage the accompanying risks. In this paper, we focus on what we term "frontier AI" models: highly capable foundation models that could possess dangerous capabilities sufficient to pose severe risks to public safety. Frontier AI models pose a distinct regulatory challenge: dangerous capabilities can arise unexpectedly; it is difficult to robustly prevent a deployed model from being misused; and, it is difficult to stop a model's capabilities from proliferating broadly. To address these challenges, at least three building blocks for the regulation of frontier models are needed: (1) standard-setting processes to identify appropriate requirements for frontier AI developers, (2) registration and reporting requirements to provide regulators with visibility into frontier AI development processes, and (3) mechanisms to ensure compliance with safety standards for the development and deployment of frontier AI models. Industry self-regulation is an important first step. However, wider societal discussions and government intervention will be needed to create standards and to ensure compliance with them. We consider several options to this end, including granting enforcement powers to supervisory authorities and licensure regimes for frontier AI models. Finally, we propose an initial set of safety standards. These include conducting pre-deployment risk assessments; external scrutiny of model behavior; using risk assessments to inform deployment decisions; and monitoring and responding to new information about model capabilities and uses post-deployment. We hope this discussion contributes to the broader conversation on how to balance public safety risks and innovation benefits from advances at the frontier of AI development.
翻译:先进人工智能模型有望为人类带来巨大利益,但社会需要主动管理随之而来的风险。本文聚焦于我们称之为"前沿人工智能"的模型:具有极高能力的基础模型,可能具备足以对公共安全构成严重威胁的危险能力。前沿人工智能模型带来了独特的监管挑战:危险能力可能意外出现;难以有效防止已部署模型被滥用;且难以阻止模型能力的广泛扩散。为应对这些挑战,至少需要三个监管前沿模型的基础构件:(1)制定标准流程,以确定对前沿人工智能开发者的适当要求;(2)注册与报告要求,使监管机构能够了解前沿人工智能开发过程;(3)确保前沿人工智能模型开发与部署符合安全标准的机制。行业自我监管是重要的第一步。然而,要制定标准并确保合规,需要更广泛的社会讨论和政府干预。我们为此考虑了多种方案,包括赋予监管机构执法权力以及针对前沿人工智能模型实施许可制度。最后,我们提出了一套初步的安全标准,包括开展部署前风险评估;对模型行为进行外部审查;利用风险评估指导部署决策;以及持续监测并应对部署后关于模型能力与用途的新信息。希望本文的讨论有助于推动更广泛的对话,以平衡前沿人工智能发展带来的公共安全风险与创新效益。