迈向人工智能问责政策 (Towards an AI Accountability Policy)

We propose establishing an office to oversee AI systems by introducing a tiered system of explainability and benchmarking requirements for commercial AI systems. We examine how complex high-risk technologies have been successfully regulated at the national level. Specifically, we draw parallels to the existing regulation for the U.S. medical device industry and the pharmaceutical industry (regulated by the FDA), the proposed legislation for AI in the European Union (the AI Act), and the existing U.S. anti-discrimination legislation. To promote accountability and user trust, AI accountability mechanisms shall introduce standarized measures for each category of intended high-risk use of AI systems to enable structured comparisons among such AI systems. We suggest using explainable AI techniques, such as input influence measures, as well as fairness statistics and other performance measures of high-risk AI systems. We propose to standardize internal benchmarking and automated audits to transparently characterize high-risk AI systems. The results of such audits and benchmarks shall be clearly and transparently communicated and explained to enable meaningful comparisons of competing AI systems via a public AI registry. Such standardized audits, benchmarks, and certificates shall be specific to intended high-risk use of respective AI systems and could constitute conformity assessment for AI systems, e.g., in the European Union's AI Act.

翻译：我们建议通过引入针对商业人工智能系统的分层可解释性与基准测试要求，设立专门机构来监管人工智能系统。本文研究了复杂高风险技术在国家层面成功监管的案例，具体参照了美国医疗器械行业与制药行业（由FDA监管）的现有法规、欧盟人工智能法案（AI Act）的立法提案以及美国现行的反歧视法律。为促进问责制与用户信任，人工智能问责机制应为每类高风险人工智能应用场景引入标准化评估指标，以实现此类系统间的结构化比较。我们建议采用可解释人工智能技术（如输入影响度量），并结合公平性统计指标及其他高风险人工智能系统的性能评估方法。我们提出标准化内部基准测试与自动化审计流程，以透明化表征高风险人工智能系统。此类审计与基准测试结果应通过公共人工智能注册库进行清晰透明的传达与解释，从而实现竞争性人工智能系统的有效比较。此类标准化审计、基准测试及认证应针对特定高风险应用场景定制，并可构成人工智能系统的符合性评估依据，例如在欧盟人工智能法案框架下的应用。

相关内容

关注 7093

人工智能杂志AI(Artificial Intelligence)是目前公认的发表该领域最新研究成果的主要国际论坛。该期刊欢迎有关AI广泛方面的论文，这些论文构成了整个领域的进步，也欢迎介绍人工智能应用的论文，但重点应该放在新的和新颖的人工智能方法如何提高应用领域的性能，而不是介绍传统人工智能方法的另一个应用。关于应用的论文应该描述一个原则性的解决方案，强调其新颖性，并对正在开发的人工智能技术进行深入的评估。官网地址：http://dblp.uni-trier.de/db/journals/ai/

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

31+阅读 · 2021年9月29日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日