Bias in textual data can lead to skewed interpretations and outcomes when the data is used. These biases could perpetuate stereotypes, discrimination, or other forms of unfair treatment. An algorithm trained on biased data ends up making decisions that disproportionately impact a certain group of people. Therefore, it is crucial to detect and remove these biases to ensure the fair and ethical use of data. To this end, we develop a comprehensive and robust framework \textsc{Nbias} that consists of a data layer, corpus contruction, model development layer and an evaluation layer. The dataset is constructed by collecting diverse data from various fields, including social media, healthcare, and job hiring portals. As such, we applied a transformer-based token classification model that is able to identify bias words/ phrases through a unique named entity. In the assessment procedure, we incorporate a blend of quantitative and qualitative evaluations to gauge the effectiveness of our models. We achieve accuracy improvements ranging from 1% to 8% compared to baselines. We are also able to generate a robust understanding of the model functioning, capturing not only numerical data but also the quality and intricacies of its performance. The proposed approach is applicable to a variety of biases and contributes to the fair and ethical use of textual data.
翻译:文本数据中的偏见可能导致在使用数据时产生扭曲的解释和结果。这些偏见可能固化刻板印象、歧视或其他形式的不公平对待。基于有偏见数据训练的算法最终会做出对特定群体产生不成比例影响的决策。因此,检测并消除这些偏见对于确保数据的公平和伦理使用至关重要。为此,我们开发了一个全面且稳健的框架\textsc{Nbias},它包含数据层、语料库构建、模型开发层和评估层。数据集通过收集来自社交媒体、医疗保健和招聘门户等多个领域的多样化数据构建而成。我们应用了一种基于Transformer的词元分类模型,该模型能够通过独特的命名实体识别偏见词汇/短语。在评估过程中,我们整合了定量与定性评估方法以衡量模型的有效性。相比基线方法,我们的准确率提升了1%至8%。同时,我们能够生成对模型功能的稳健理解,不仅捕捉数值数据,还涵盖其性能的质量与复杂性。所提出的方法适用于多种偏见类型,有助于文本数据的公平与伦理使用。