Bridging the Gap in the Responsible AI Divides

Tensions between AI Safety (AIS) and AI Ethics (AIE) have increasingly surfaced in AI governance and public debates about AI, leading to what we term the "responsible AI divides". We introduce a model that categorizes four modes of engagement with the tensions: radical confrontation, disengagement, compartmentalized coexistence, and critical bridging. We then investigate how critical bridging, with a particular focus on bridging problems, offers one of the most viable constructive paths for advancing responsible AI. Using computational tools to analyze a curated dataset of 3,550 papers, we map the research landscapes of AIE and AIS to identify both distinct and overlapping problems. Our findings point to both thematic divides and overlaps. For example, we find that AIE has long grappled with overcoming injustice and tangible AI harms, whereas AIS has primarily embodied an anticipatory approach focused on the mitigation of risks from AI capabilities. At the same time, we find significant overlap in core research concerns across both AIE and AIS around transparency, reproducibility, and inadequate governance mechanisms. As AIE and AIS continue to evolve, we recommend focusing on bridging problems as a constructive path forward for enhancing collaborative AI governance. We offer a series of recommendations to integrate shared considerations into a collaborative approach to responsible AI. Alongside our proposal, we highlight its limitations and explore open problems for future research. All data including the fully annotated dataset of papers with code to reproduce our figures can be found at: https://github.com/gyevnarb/ai-safety-ethics.

翻译：人工智能安全（AIS）与人工智能伦理（AIE）之间的张力在人工智能治理和关于人工智能的公共辩论中日益凸显，形成了我们所谓的“负责任人工智能分歧”。我们引入一个模型，将应对这些张力的方式分为四类：激进对抗、脱离接触、区隔共存与批判性弥合。随后，我们探讨了批判性弥合——尤其侧重于弥合性问题——如何为推进负责任人工智能提供最具可行性的建设性路径之一。通过使用计算工具分析一个包含3550篇论文的精选数据集，我们绘制了AIE和AIS的研究版图，以识别其各自独特及相互重叠的问题。我们的研究结果揭示了主题上的分歧与重叠。例如，我们发现AIE长期致力于克服不公正和具体的人工智能危害，而AIS则主要体现为一种前瞻性方法，专注于缓解人工智能能力带来的风险。与此同时，我们发现AIE和AIS在透明度、可复现性以及治理机制不足等核心研究关切上存在显著重叠。随着AIE和AIS的持续发展，我们建议将重点放在弥合性问题上，将其作为加强协作式人工智能治理的建设性前进方向。我们提出一系列建议，旨在将共同的考量整合到负责任人工智能的协作方法中。在提出建议的同时，我们也指出了其局限性，并探讨了未来研究的开放性问题。所有数据，包括完整标注的论文数据集及用于复现我们图表的代码，均可在以下网址找到：https://github.com/gyevnarb/ai-safety-ethics。