In this position paper, we argue that instead of morally aligning LLMs to specific set of ethical principles, we should infuse generic ethical reasoning capabilities into them so that they can handle value pluralism at a global scale. When provided with an ethical policy, an LLM should be capable of making decisions that are ethically consistent to the policy. We develop a framework that integrates moral dilemmas with moral principles pertaining to different foramlisms of normative ethics, and at different levels of abstractions. Initial experiments with GPT-x models shows that while GPT-4 is a nearly perfect ethical reasoner, the models still have bias towards the moral values of Western and English speaking societies.
翻译:在这篇立场论文中,我们认为不应将大语言模型与特定伦理原则进行道德对齐,而应赋予其通用的伦理推理能力,使其能在全球范围内处理价值多元性问题。当给定伦理政策时,大语言模型应能做出与该政策伦理一致的决策。我们构建了一个将道德困境与不同规范性伦理学形式主义及抽象层级的道德原则相整合的框架。初步针对GPT-x系列模型的实验表明,虽然GPT-4近乎完美的伦理推理者,但模型仍存在对西方英语社会道德价值观的偏见。