Model and Evaluation: Towards Fairness in Multilingual Text Classification

Recently, more and more research has focused on addressing bias in text classification models. However, existing research mainly focuses on the fairness of monolingual text classification models, and research on fairness for multilingual text classification is still very limited. In this paper, we focus on the task of multilingual text classification and propose a debiasing framework for multilingual text classification based on contrastive learning. Our proposed method does not rely on any external language resources and can be extended to any other languages. The model contains four modules: multilingual text representation module, language fusion module, text debiasing module, and text classification module. The multilingual text representation module uses a multilingual pre-trained language model to represent the text, the language fusion module makes the semantic spaces of different languages tend to be consistent through contrastive learning, and the text debiasing module uses contrastive learning to make the model unable to identify sensitive attributes' information. The text classification module completes the basic tasks of multilingual text classification. In addition, the existing research on the fairness of multilingual text classification is relatively simple in the evaluation mode. The evaluation method of fairness is the same as the monolingual equality difference evaluation method, that is, the evaluation is performed on a single language. We propose a multi-dimensional fairness evaluation framework for multilingual text classification, which evaluates the model's monolingual equality difference, multilingual equality difference, multilingual equality performance difference, and destructiveness of the fairness strategy. We hope that our work can provide a more general debiasing method and a more comprehensive evaluation framework for multilingual text fairness tasks.

翻译：近年来，越来越多的研究聚焦于解决文本分类模型中的偏差问题。然而，现有研究主要关注单语言文本分类模型的公平性，针对多语言文本分类公平性的研究仍十分有限。本文聚焦多语言文本分类任务，提出了一种基于对比学习的多语言文本分类去偏框架。所提出的方法不依赖任何外部语言资源，可扩展至其他语言。该模型包含四个模块：多语言文本表示模块、语言融合模块、文本去偏模块和文本分类模块。多语言文本表示模块使用多语言预训练语言模型表示文本，语言融合模块通过对比学习使不同语言的语义空间趋于一致，文本去偏模块利用对比学习使模型无法识别敏感属性信息，文本分类模块完成多语言文本分类的基础任务。此外，现有关于多语言文本分类公平性的研究在评估模式上较为单一，其公平性评估方法与单语言平等差异评估方法相同，即仅在单一语言上进行评估。本文提出了一种面向多语言文本分类的多维公平性评估框架，评估模型的单语言平等差异、多语言平等差异、多语言平等性能差异以及公平策略的破坏性。我们期望本研究能为多语言文本公平性任务提供更通用的去偏方法和更全面的评估框架。