In this multicultural age, language translation is one of the most performed tasks, and it is becoming increasingly AI-moderated and automated. As a novel AI system, ChatGPT claims to be proficient in such translation tasks and in this paper, we put that claim to the test. Specifically, we examine ChatGPT's accuracy in translating between English and languages that exclusively use gender-neutral pronouns. We center this study around Bengali, the 7$^{th}$ most spoken language globally, but also generalize our findings across five other languages: Farsi, Malay, Tagalog, Thai, and Turkish. We find that ChatGPT perpetuates gender defaults and stereotypes assigned to certain occupations (e.g. man = doctor, woman = nurse) or actions (e.g. woman = cook, man = go to work), as it converts gender-neutral pronouns in languages to `he' or `she'. We also observe ChatGPT completely failing to translate the English gender-neutral pronoun `they' into equivalent gender-neutral pronouns in other languages, as it produces translations that are incoherent and incorrect. While it does respect and provide appropriately gender-marked versions of Bengali words when prompted with gender information in English, ChatGPT appears to confer a higher respect to men than to women in the same occupation. We conclude that ChatGPT exhibits the same gender biases which have been demonstrated for tools like Google Translate or MS Translator, as we provide recommendations for a human centered approach for future designers of AIs that perform language translation to better accommodate such low-resource languages.
翻译:在这个多元文化时代,语言翻译是最常执行的任务之一,且日益受到人工智能管理和自动化。作为新型人工智能系统,ChatGPT 声称精通此类翻译任务,而本文对其这一主张进行了检验。具体而言,我们考察了 ChatGPT 在英语与仅使用中性性别代词的语言之间进行翻译的准确性。本研究以全球第七大使用语言孟加拉语为中心,同时将发现推广至其他五种语言:波斯语、马来语、他加禄语、泰语和土耳其语。我们发现,ChatGPT 在将中性性别代词转换为“他”或“她”时,延续了针对特定职业(如男性=医生,女性=护士)或行为(如女性=做饭,男性=上班)的性别默认设定与刻板印象。我们还观察到,ChatGPT 完全无法将英语中性性别代词“they”翻译成其他语言中对应的中性性别代词,从而产生不连贯且错误的译文。尽管在英语提示中提供性别信息时,ChatGPT 会对孟加拉语词汇给出尊重且适当标注性别的版本,但它在同一职业中似乎给予男性比女性更高的尊重。我们得出结论,ChatGPT 表现出与 Google Translate 或 MS Translator 等工具相同的性别偏见,并为此类执行语言翻译任务的人工智能未来设计者提供建议,即采用以人为中心的方法以更好地适应这些低资源语言。