Contrary to Google Search's mission of delivering information from "many angles so you can form your own understanding of the world," we find that Google and its most prominent returned results - Wikipedia and YouTube - simply reflect a narrow set of cultural stereotypes tied to the search language for complex topics like "Buddhism," "Liberalism," "colonization," "Iran" and "America." Simply stated, they present, to varying degrees, distinct information across the same search in different languages, a phenomenon we call 'language bias.' This paper presents evidence and analysis of language bias and discusses its larger social implications. Instead of presenting a global picture of a complex topic, our online searches and emerging tools like ChatGPT turn us into the proverbial blind person touching a small portion of an elephant, ignorant of the existence of other cultural perspectives. Piecing together a genuine depiction of the elephant is a challenging and important endeavor that will require collaborative efforts from scholars in both the humanities and technology.
翻译:与谷歌搜索“从多角度提供信息,助您形成对世界的独立理解”的使命相反,我们发现谷歌及其最突出的返回结果——维基百科和YouTube——仅仅反映了一套与搜索语言紧密相关的狭隘文化刻板印象,涉及诸如“佛教”、“自由主义”、“殖民化”、“伊朗”和“美国”等复杂话题。简言之,它们对同一搜索词在不同语言下呈现出不同程度的差异化信息,我们将这一现象称为“语言偏见”。本文提供了关于语言偏见的证据与分析,并探讨其更广泛的社会影响。我们的在线搜索及ChatGPT等新兴工具并未呈现复杂话题的全球全貌,反而使我们如同寓言中仅触摸大象一小部分的盲人,对其他文化视角的存在一无所知。拼凑出大象的真实描绘是一项艰巨而重要的工作,需要人文学科与技术领域学者的协同努力。