Two Types of AI Existential Risk: Decisive and Accumulative

The conventional discourse on existential risks (x-risks) from AI typically focuses on abrupt, dire events caused by advanced AI systems, particularly those that might achieve or surpass human-level intelligence. These events have severe consequences that either lead to human extinction or irreversibly cripple human civilization to a point beyond recovery. This discourse, however, often neglects the serious possibility of AI x-risks manifesting incrementally through a series of smaller yet interconnected disruptions, gradually crossing critical thresholds over time. This paper contrasts the conventional "decisive AI x-risk hypothesis" with an "accumulative AI x-risk hypothesis." While the former envisions an overt AI takeover pathway, characterized by scenarios like uncontrollable superintelligence, the latter suggests a different causal pathway to existential catastrophes. This involves a gradual accumulation of critical AI-induced threats such as severe vulnerabilities and systemic erosion of econopolitical structures. The accumulative hypothesis suggests a boiling frog scenario where incremental AI risks slowly converge, undermining resilience until a triggering event results in irreversible collapse. Through systems analysis, this paper examines the distinct assumptions differentiating these two hypotheses. It is then argued that the accumulative view reconciles seemingly incompatible perspectives on AI risks. The implications of differentiating between these causal pathways -- the decisive and the accumulative -- for the governance of AI risks as well as long-term AI safety are discussed.

翻译：关于人工智能（AI）引发存在风险（x-risks）的传统论述通常聚焦于先进AI系统（尤其是可能达到或超越人类智力水平的系统）所导致的突发性、灾难性事件。这些事件会引发严重后果，要么导致人类灭绝，要么对人类文明造成不可逆的摧毁，使其无法恢复。然而，这种论述往往忽视了另一类严重的可能性：AI存在风险通过一系列较小但相互关联的扰动逐步显现，并随时间推移逐渐跨越关键阈值。本文对比了传统的"突发性AI存在风险假说"与"累积性AI存在风险假说"。前者设想了一种显性的AI接管路径，典型场景包括不可控的超级智能；后者则揭示了导致存在灾难的不同因果路径，即关键性的AI诱发威胁（如严重脆弱性与经济政治结构的系统性侵蚀）逐步累积。累积性假说描绘了一种"温水煮青蛙"式的情景：递增的AI风险缓慢汇聚，不断削弱系统韧性，直至某个触发事件导致不可逆的崩溃。通过系统分析，本文检验了区分这两种假说的独特假设，进而论证累积性视角能够调和关于AI风险的看似矛盾的观点。文章最后探讨了区分这两种因果路径（突发性与累积性）对AI风险治理及长期AI安全性研究的启示。

相关内容

关注 7110

人工智能杂志AI(Artificial Intelligence)是目前公认的发表该领域最新研究成果的主要国际论坛。该期刊欢迎有关AI广泛方面的论文，这些论文构成了整个领域的进步，也欢迎介绍人工智能应用的论文，但重点应该放在新的和新颖的人工智能方法如何提高应用领域的性能，而不是介绍传统人工智能方法的另一个应用。关于应用的论文应该描述一个原则性的解决方案，强调其新颖性，并对正在开发的人工智能技术进行深入的评估。官网地址：http://dblp.uni-trier.de/db/journals/ai/

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日