Synthetic History: Evaluating Visual Representations of the Past in Diffusion Models

As Text-to-Image (TTI) diffusion models become increasingly influential in content creation, growing attention is being directed toward their societal and cultural implications. While prior research has primarily examined demographic and cultural biases, the ability of these models to accurately represent historical contexts remains largely underexplored. To address this gap, we introduce a benchmark for evaluating how TTI models depict historical contexts. The benchmark combines HistVis, a dataset of 30,000 synthetic images generated by three state-of-the-art diffusion models from carefully designed prompts covering universal human activities across multiple historical periods, with a reproducible evaluation protocol. We evaluate generated imagery across three key aspects: (1) Implicit Stylistic Associations: examining default visual styles associated with specific eras; (2) Historical Consistency: identifying anachronisms such as modern artifacts in pre-modern contexts; and (3) Demographic Representation: comparing generated racial and gender distributions against historically plausible baselines. Our findings reveal systematic inaccuracies in historically themed generated imagery, as TTI models frequently stereotype past eras by incorporating unstated stylistic cues, introduce anachronisms, and fail to reflect plausible demographic patterns. By providing a reproducible benchmark for historical representation in generated imagery, this work provides an initial step toward building more historically accurate TTI models.

翻译：随着文本到图像（TTI）扩散模型在内容创作中的影响力日益增强，其社会与文化影响正受到越来越多的关注。尽管先前研究主要考察了人口统计与文化偏见，但这些模型准确表征历史语境的能力在很大程度上仍未得到充分探索。为填补这一空白，我们引入了一个用于评估TTI模型如何描绘历史语境的基准。该基准结合了HistVis数据集（包含由三种先进扩散模型根据精心设计的提示生成的30,000张合成图像，涵盖多个历史时期中普遍的人类活动）与一套可复现的评估方案。我们从三个关键维度评估生成的图像：（1）隐含风格关联：考察与特定时代相关联的默认视觉风格；（2）历史一致性：识别时代错位现象，例如前现代语境中出现现代器物；（3）人口表征：将生成的种族与性别分布与历史可信基线进行比较。我们的研究揭示了历史主题生成图像中存在系统性不准确问题：TTI模型经常通过融入未声明的风格线索来刻板化过去时代，引入时代错位，且未能反映可信的人口分布模式。通过为生成图像中的历史表征提供可复现的基准，本研究为构建更具历史准确性的TTI模型迈出了初步的一步。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

O’Reilly报告：知识图谱崛起——面向现代数据集成和数据结构体系，“The Rise of the Knowledge Graph——Toward Modern Data Integration and the Data Fabric Architecture”

专知会员服务

49+阅读 · 2022年2月18日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

35+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日