For humanity to maintain and expand its agency into the future, the most powerful systems we create must be those which act to align the future with the will of humanity. The most powerful systems today are massive institutions like governments, firms, and NGOs. Deliberative technology is already being used across these institutions to help align governance and diplomacy with human will, and modern AI is poised to make this technology significantly better. At the same time, the race to superhuman AGI is already underway, and the AI systems it gives rise to may become the most powerful systems of the future. Failure to align the impact of such powerful AI with the will of humanity may lead to catastrophic consequences, while success may unleash abundance. Right now, there is a window of opportunity to use deliberative technology to align the impact of powerful AI with the will of humanity. Moreover, it may be possible to engineer a symbiotic coupling between powerful AI and deliberative alignment systems such that the quality of alignment improves as AI capabilities increase.
翻译:为使人性在未来能够维持并扩展其能动性,我们所创造的最强大系统必须致力于将未来与人类意愿对齐。当今最强大的系统是政府、企业和非政府组织等大型机构。审议技术已在这些机构中被用于推动治理与外交同人类意愿的对齐进程,而现代人工智能有望大幅提升该技术的效能。与此同时,超人类通用人工智能的竞赛已然展开,由此催生的人工智能系统或将构成未来最强大的力量。倘若未能使此类强人工智能的影响力与人类意愿对齐,可能导致灾难性后果;反之,若能实现对齐,则可能开启丰裕时代。当下存在一个机遇窗口——通过运用审议技术将强人工智能的影响力与人类意愿对齐。更值得关注的是,我们有可能设计出强人工智能与审议对齐系统之间的共生耦合机制,使对齐质量随人工智能能力的增强而持续提升。