On Hate Scaling Laws For Data-Swamps

`Scale the model, scale the data, scale the GPU-farms' is the reigning sentiment in the world of generative AI today. While model scaling has been extensively studied, data scaling and its downstream impacts remain under explored. This is especially of critical importance in the context of visio-linguistic datasets whose main source is the World Wide Web, condensed and packaged as the CommonCrawl dump. This large scale data-dump, which is known to have numerous drawbacks, is repeatedly mined and serves as the data-motherlode for large generative models. In this paper, we: 1) investigate the effect of scaling datasets on hateful content through a comparative audit of the LAION-400M and LAION-2B-en, containing 400 million and 2 billion samples respectively, and 2) evaluate the downstream impact of scale on visio-linguistic models trained on these dataset variants by measuring racial bias of the models trained on them using the Chicago Face Dataset (CFD) as a probe. Our results show that 1) the presence of hateful content in datasets, when measured with a Hate Content Rate (HCR) metric on the inferences of the Pysentimiento hate-detection Natural Language Processing (NLP) model, increased by nearly $12\%$ and 2) societal biases and negative stereotypes were also exacerbated with scale on the models we evaluated. As scale increased, the tendency of the model to associate images of human faces with the `human being' class over 7 other offensive classes reduced by half. Furthermore, for the Black female category, the tendency of the model to associate their faces with the `criminal' class doubled, while quintupling for Black male faces. We present a qualitative and historical analysis of the model audit results, reflect on our findings and its implications for dataset curation practice, and close with a summary of our findings and potential future work to be done in this area.

翻译：“缩放模型、缩放数据、缩放GPU集群”是当今生成式AI领域的主流观点。尽管模型缩放已被广泛研究，但数据缩放及其下游影响仍未被充分探索。这一点在以万维网为主要来源、经浓缩打包为CommonCrawl数据转储的视觉-语言数据集中尤为关键。这种大规模数据转储已知存在诸多缺陷，却反复被挖掘，成为大型生成式模型的数据宝库。本文：1）通过对比审计包含4亿样本的LAION-400M和包含20亿样本的LAION-2B-en数据集，研究数据集缩放对仇恨内容的影响；2）通过使用芝加哥人脸数据集（CFD）作为探针，测量基于这些数据集变体训练的视觉-语言模型的种族偏见，评估缩放对模型的下游影响。我们的结果表明：1）基于Pysentimiento仇恨检测自然语言处理（NLP）模型推理的仇恨内容率（HCR）指标测量显示，数据集中仇恨内容的存在比例增加了近12%；2）在我们评估的模型中，社会偏见和负面刻板印象也随缩放而加剧。随着缩放增加，模型将人脸图像与“人类”类别关联（而非其他7个冒犯性类别）的倾向降低了一半。此外，对于黑人女性类别，模型将其面部与“罪犯”类别关联的倾向增加了一倍，而对于黑人男性面部，则增加了五倍。我们提供了模型审计结果的定性和历史分析，反思了我们的发现及其对数据集策展实践的影响，并以总结及该领域潜在未来工作作为结尾。