高风险AI系统的自我认证：以基于AI的面部情绪识别为例 (Self-Certification of High-Risk AI Systems: The Example of AI-based Facial Emotion Recognition)

The European Union's Artificial Intelligence Act establishes comprehensive requirements for high-risk AI systems, yet the harmonized standards necessary for demonstrating compliance remain not fully developed. In this paper, we investigate the practical application of the Fraunhofer AI assessment catalogue as a certification framework through a complete self-certification cycle of an AI-based facial emotion recognition system. Beginning with a baseline model that has deficiencies, including inadequate demographic representation and prediction uncertainty, we document an enhancement process guided by AI certification requirements. The enhanced system achieves higher accuracy with improved reliability metrics and comprehensive fairness across demographic groups. We focused our assessment on two of the six Fraunhofer catalogue dimensions, reliability and fairness, the enhanced system successfully satisfies the certification criteria for these examined dimensions. We find that the certification framework provides value as a proactive development tool, driving concrete technical improvements and generating documentation naturally through integration into the development process. However, fundamental gaps separate structured self-certification from legal compliance: harmonized European standards are not fully available, and AI assessment frameworks and catalogues cannot substitute for them on their own. These findings establish the Fraunhofer AI assessment catalogue as a valuable preparatory tool that complements rather than replaces formal compliance requirements at this time.

翻译：欧盟《人工智能法案》为高风险AI系统确立了全面的合规要求，但用于证明合规性的统一标准尚未完全制定。本文通过一个基于AI的面部情绪识别系统的完整自我认证周期，研究了弗劳恩霍夫AI评估目录作为认证框架的实际应用。我们从一个存在缺陷的基线模型出发，其缺陷包括人口统计代表性不足和预测不确定性，并记录了在AI认证要求指导下进行的系统增强过程。增强后的系统实现了更高的准确率，其可靠性指标得到改善，并在各人口统计组间实现了全面的公平性。我们将评估重点放在弗劳恩霍夫目录六个维度中的两个——可靠性与公平性上，增强后的系统成功满足了所考察维度的认证标准。我们发现，该认证框架作为一种主动的开发工具具有价值，它能推动具体的技术改进，并通过融入开发过程自然生成文档。然而，结构化的自我认证与法律合规之间存在根本性差距：欧洲统一标准尚未完全可用，且AI评估框架和目录本身无法替代这些标准。这些发现表明，弗劳恩霍夫AI评估目录目前是一种有价值的预备性工具，它补充而非取代了正式的合规要求。

相关内容

关注 7103

人工智能杂志AI(Artificial Intelligence)是目前公认的发表该领域最新研究成果的主要国际论坛。该期刊欢迎有关AI广泛方面的论文，这些论文构成了整个领域的进步，也欢迎介绍人工智能应用的论文，但重点应该放在新的和新颖的人工智能方法如何提高应用领域的性能，而不是介绍传统人工智能方法的另一个应用。关于应用的论文应该描述一个原则性的解决方案，强调其新颖性，并对正在开发的人工智能技术进行深入的评估。官网地址：http://dblp.uni-trier.de/db/journals/ai/

AI 智能体系统：体系架构、应用场景及评估范式

专知会员服务

67+阅读 · 1月6日

《人工智能军事系统的风险分级监管路径》

专知会员服务

22+阅读 · 2025年7月10日

【博士论文】迈向负责任的人工智能：自主系统在安全性、公平性与可问责性方面的最新进展

专知会员服务

20+阅读 · 2025年6月15日

《防务领域人工智能可信赖性：为防务开发负责任、符合伦理且可信赖的AI系统》欧洲防务局2025最新107页

专知会员服务

22+阅读 · 2025年5月14日