Guidance on how to validate computational text-based measures of social constructs is fragmented. While researchers generally acknowledge the importance of validating text-based measures, they often lack a shared vocabulary and a unified framework to do so. This paper introduces ValiText, a new validation framework designed to assist scholars in validly measuring social constructs in textual data. The framework is built on a conceptual foundation of validity in the social sciences, strengthened by an empirical review of validation practices in the social sciences and consultations with experts. Ultimately, ValiText prescribes researchers to demonstrate three types of validation evidence: substantive evidence (outlining the theoretical underpinning of the measure), structural evidence (examining the properties of the text model and its output) and external evidence (testing for how the measure relates to independent information). The framework is further supplemented by a checklist of validation steps, offering practical guidance in the form of documentation sheets that guide researchers in the validation process.
翻译:关于如何验证基于文本的社会构念计算测量方法,现有指导较为零散。尽管研究者普遍认同验证文本测量方法的重要性,但往往缺乏统一的术语体系和验证框架。本文提出ValiText这一新型验证框架,旨在帮助学者有效测量文本数据中的社会构念。该框架以社会科学效度概念为基础,通过系统梳理社会科学领域的验证实践并咨询领域专家而构建。ValiTex最终要求研究者提供三类效度证据:实质性证据(阐明测量的理论基础)、结构证据(检验文本模型及其输出的特性)以及外部证据(检验测量与独立信息的关系)。该框架还附有验证步骤清单,通过提供标准化文档模板的形式,为研究者的验证过程提供实践指导。