Data reuse is a common practice in the social sciences. While published data play an essential role in the production of social science research, they are not consistently cited, which makes it difficult to assess their full scholarly impact and give credit to the original data producers. Furthermore, it can be challenging to understand researchers' motivations for referencing data. Like references to academic literature, data references perform various rhetorical functions, such as paying homage, signaling disagreement, or drawing comparisons. This paper studies how and why researchers reference social science data in their academic writing. We develop a typology to model relationships between the entities that anchor data references, along with their features (access, actions, locations, styles, types) and functions (critique, describe, illustrate, interact, legitimize). We illustrate the use of the typology by coding multidisciplinary research articles (n=30) referencing social science data archived at the Inter-university Consortium for Political and Social Research (ICPSR). We show how our typology captures researchers' interactions with data and purposes for referencing data. Our typology provides a systematic way to document and analyze researchers' narratives about data use, extending our ability to give credit to data that support research.
翻译:数据重用是社会科学领域的常见实践。虽然已发表数据在社会科学研究中发挥着关键作用,但并未得到一致引用,这使得难以评估其完整的学术影响力,也无法对原始数据生产者给予认可。此外,理解研究人员引用数据的动机也颇具挑战性。与学术文献引用类似,数据引用发挥着多种修辞功能,如表示致敬、表达异议或进行对比。本文研究了研究人员如何在学术写作中引用社会科学数据及其原因。我们开发了一个类型学框架,用于建模锚定数据引用的实体之间的关系及其特征(获取途径、操作行为、位置、风格、类型)和功能(批判、描述、说明、交互、合法化)。通过编码引用跨大学政治与社会研究联盟(ICPSR)存档的社会科学数据的多学科研究论文(n=30),我们展示了该类型学的应用。我们揭示了类型学如何捕捉研究人员与数据的交互过程及其引用数据的目的。该类型学为系统记录和分析研究人员关于数据使用的叙述提供了方法,拓展了我们认可支撑研究的数据的能力。