Deontological ethics, specifically understood through Immanuel Kant, provides a moral framework that emphasizes the importance of duties and principles, rather than the consequences of action. Understanding that despite the prominence of deontology, it is currently an overlooked approach in fairness metrics, this paper explores the compatibility of a Kantian deontological framework in fairness metrics, part of the AI alignment field. We revisit Kant's critique of utilitarianism, which is the primary approach in AI fairness metrics and argue that fairness principles should align with the Kantian deontological framework. By integrating Kantian ethics into AI alignment, we not only bring in a widely-accepted prominent moral theory but also strive for a more morally grounded AI landscape that better balances outcomes and procedures in pursuit of fairness and justice.
翻译:义务论伦理学,特别是通过伊曼努尔·康德的理论框架,强调义务与原则的重要性,而非行为后果。尽管义务论具有突出地位,但在公平性度量领域,这一方法目前仍被忽视。本文探讨了康德义务论框架在公平性度量(属于人工智能对齐领域)中的兼容性。我们重新审视康德对功利主义的批判(后者是人工智能公平性度量的主要方法),并论证公平性原则应与康德义务论框架相契合。通过将康德伦理学融入人工智能对齐,我们不仅能引入广受认可的重要道德理论,还能致力于构建更具道德基础的人工智能图景,在追求公平与正义的过程中更好地平衡结果与程序。