Guidance on how to validate computational text-based measures of social constructs is fragmented. While researchers generally acknowledge the importance of validating text-based measures, they often lack a shared vocabulary and a unified framework to do so. This paper introduces ValiText, a new validation framework designed to assist scholars in validly measuring social constructs in textual data. The framework is built on a conceptual foundation of validity in the social sciences, strengthened by an empirical review of validation practices in the social sciences and consultations with experts. Ultimately, ValiText prescribes researchers to demonstrate three types of validation evidence: substantive evidence (outlining the theoretical underpinning of the measure), structural evidence (examining the properties of the text model and its output) and external evidence (testing for how the measure relates to independent information). The framework is further supplemented by a checklist of validation steps, offering practical guidance in the form of documentation sheets that guide researchers in the validation process.
翻译:关于如何验证基于计算文本的社会建构度量方法的指导目前较为分散。尽管研究人员普遍认可文本度量验证的重要性,但他们往往缺乏统一的术语体系和系统框架。本文提出ValiText这一新型验证框架,旨在帮助学者有效度量文本数据中的社会建构。该框架建立在社会科学效度理论的概念基础上,并通过社会科学验证实践的实证回顾及专家咨询得到强化。最终,ValiText要求研究者展示三类验证证据:实质性证据(阐述度量的理论基础)、结构性证据(检验文本模型及其输出的属性)和外部证据(测试度量与独立信息的关系)。该框架还辅以验证步骤清单,以文档记录表的形式提供实践指导,引导研究者完成验证过程。