A computational framework of human values for ethical AI

In the diverse array of work investigating the nature of human values from psychology, philosophy and social sciences, there is a clear consensus that values guide behaviour. More recently, a recognition that values provide a means to engineer ethical AI has emerged. Indeed, Stuart Russell proposed shifting AI's focus away from simply ``intelligence'' towards intelligence ``provably aligned with human values''. This challenge -- the value alignment problem -- with others including an AI's learning of human values, aggregating individual values to groups, and designing computational mechanisms to reason over values, has energised a sustained research effort. Despite this, no formal, computational definition of values has yet been proposed. We address this through a formal conceptual framework rooted in the social sciences, that provides a foundation for the systematic, integrated and interdisciplinary investigation into how human values can support designing ethical AI.

翻译：在心理学、哲学和社会科学领域对人类价值观本质的多样化研究工作中，存在一个明确共识：价值观引导行为。近年来，人们逐渐认识到价值观为设计伦理人工智能提供了重要途径。事实上，斯图尔特·拉塞尔提出应将人工智能的研究重心从单纯的"智能"转向"与人类价值可证明一致"的智能。这一挑战——价值对齐问题——连同人工智能学习人类价值观、将个体价值观聚合为群体价值观、以及设计基于价值观推理的计算机制等问题，持续推动着相关研究。尽管如此，目前尚未提出正式的计算意义上的价值观定义。我们通过一个根植于社会科学的形式化概念框架来应对这一挑战，该框架为系统化、跨学科地研究人类价值观如何支持伦理人工智能设计提供了基础。

相关内容

关注 7110

人工智能杂志AI(Artificial Intelligence)是目前公认的发表该领域最新研究成果的主要国际论坛。该期刊欢迎有关AI广泛方面的论文，这些论文构成了整个领域的进步，也欢迎介绍人工智能应用的论文，但重点应该放在新的和新颖的人工智能方法如何提高应用领域的性能，而不是介绍传统人工智能方法的另一个应用。关于应用的论文应该描述一个原则性的解决方案，强调其新颖性，并对正在开发的人工智能技术进行深入的评估。官网地址：http://dblp.uni-trier.de/db/journals/ai/

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

33页PPT【AI+天气预测】，AI and Machine learning for weather predictions

专知会员服务

35+阅读 · 2022年3月5日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日