Minimal Oversight: Uncertainty-Aware Governance for Delegated AI Systems

from arxiv, Companion Python package: pip install minimal-oversight | Code: https://github.com/crbazevedo/delegation-lab | 26 pages, 1 figure, 5 tables

AI systems increasingly delegate decisions to specialized models, evaluators, tools, and supervisory controllers. The central AI problem is no longer only model accuracy, but uncertainty-aware governance: how much autonomy to grant, which evidence should calibrate trust, what performance ceiling a delegated AI system can sustain, and when human intervention becomes necessary. We propose the Minimum Sufficient Oversight Principle (MSO), a variational principle for principled autonomy delegation: minimize governance burden on the Fisher information manifold subject to a delivery constraint. The resulting Euler-Lagrange solution yields a water-filling allocation of governed delegation across the task space. Building on a revealed-action governed delegation channel model, we prove a capacity theorem for stationary symbolwise review policies, derive a local first-order approximation relating workflow complexity to quality degradation, and give a drift-dominated autonomy-time scaling law linking intervention timing to effective capacity, complexity, and drift. Within this framework, masking appears as a structural AI-governance pathology: corrected performance can hide the competence signal needed to calibrate trust. Synthetic simulations and a semi-real reconstructed workflow support design prescriptions including upstream-first correction, sensitivity-based intervention, and explicit feasibility checks before autonomy is expanded. The result is a computable framework for uncertainty, planning, and oversight in delegated AI systems. A companion Python package is available at https://github.com/crbazevedo/delegation-lab.

翻译：人工智能系统日益将决策委托给专门模型、评估器、工具和监督控制器。核心AI问题已不再仅是模型精度，而是不确定性感知治理：应授予多少自主权、哪些证据可用于校准信任、委托式AI系统可维持的性能上限，以及何时需要人类干预。我们提出最小充分监督原则（MSO），这是一种用于原则性自主委托的变分原理：在满足交付约束条件下，最小化Fisher信息流形上的治理负担。由此得到的欧拉-拉格朗日解给出了任务空间上的注水式委托治理分配方案。基于显性行为委托治理信道模型，我们证明了平稳符号级审查策略的容量定理，推导出关联工作流复杂度与质量退化的一阶局部近似关系，并建立了漂移主导的自主权-时间标度律，将干预时机与有效容量、复杂度和漂移联系起来。在该框架内，遮掩表现为一种结构性AI治理病态：修正后的性能可能掩盖校准信任所需的能力信号。综合仿真与半真实重构工作流支持如下设计准则：优先上游修正、基于灵敏度的干预，以及在扩展自主权前进行显式可行性检验。研究结果为委托式AI系统中的不确定性、规划与监督提供了可计算框架。配套Python工具包发布于https://github.com/crbazevedo/delegation-lab。

相关内容

关注 7111

人工智能杂志AI(Artificial Intelligence)是目前公认的发表该领域最新研究成果的主要国际论坛。该期刊欢迎有关AI广泛方面的论文，这些论文构成了整个领域的进步，也欢迎介绍人工智能应用的论文，但重点应该放在新的和新颖的人工智能方法如何提高应用领域的性能，而不是介绍传统人工智能方法的另一个应用。关于应用的论文应该描述一个原则性的解决方案，强调其新颖性，并对正在开发的人工智能技术进行深入的评估。官网地址：http://dblp.uni-trier.de/db/journals/ai/

【博士论文】已对齐人工智能系统的持久脆弱性

专知会员服务

12+阅读 · 4月15日

【博士论文】已对齐 AI 系统的持续脆弱性

专知会员服务

14+阅读 · 4月3日

人工智能治理的未来

专知会员服务

31+阅读 · 2025年8月3日

【博士论文】迈向负责任的人工智能：自主系统在安全性、公平性与可问责性方面的最新进展

专知会员服务

20+阅读 · 2025年6月15日