Generative AI models continue to become more powerful. The launch of ChatGPT in November 2022 has ushered in a new era of AI. ChatGPT and other similar chatbots have a range of capabilities, from answering student homework questions to creating music and art. There are already concerns that humans may be replaced by chatbots for a variety of jobs. Because of the wide spectrum of data chatbots are built on, we know that they will have human errors and human biases built into them. These biases may cause significant harm and/or inequity toward different subpopulations. To understand the strengths and weakness of chatbot responses, we present a position paper that explores different use cases of ChatGPT to determine the types of questions that are answered fairly and the types that still need improvement. We find that ChatGPT is a fair search engine for the tasks we tested; however, it has biases on both text generation and code generation. We find that ChatGPT is very sensitive to changes in the prompt, where small changes lead to different levels of fairness. This suggests that we need to immediately implement "corrections" or mitigation strategies in order to improve fairness of these systems. We suggest different strategies to improve chatbots and also advocate for an impartial review panel that has access to the model parameters to measure the levels of different types of biases and then recommends safeguards that move toward responses that are less discriminatory and more accurate.
翻译:生成式人工智能模型正持续变得更加强大。2022年11月ChatGPT的发布开启了人工智能的新纪元。ChatGPT及其他类似聊天机器人具备从回答学生作业问题到创作音乐和艺术等一系列能力。人们已开始担忧聊天机器人可能取代人类从事多种工作。由于构建聊天机器人的数据涵盖广泛范畴,我们知道其中会内嵌人类的错误和偏见。这些偏见可能对不同群体造成严重伤害和/或不公平。为理解聊天机器人回答的优势与不足,我们提出一篇立场论文,探索ChatGPT的不同用例,以判定哪些类型的问题能得到公正回答,哪些类型仍需改进。我们发现在所测试的任务中,ChatGPT是一个公平的搜索引擎;然而,它在文本生成和代码生成方面均存在偏见。我们还发现ChatGPT对提示的变化极为敏感,微小的改动会导致不同级别的公平性。这表明我们亟需实施"修正"或缓解策略,以提升这些系统的公平性。我们提出了改进聊天机器人的不同策略,并倡导建立一个公正的审查小组,该小组可访问模型参数以衡量不同类型偏见的程度,进而推荐防护措施,推动回答更加去歧视、更准确。