Calls to make scientific research more open have gained traction with a range of societal stakeholders. Open Science practices include but are not limited to the early sharing of results via preprints and openly sharing outputs such as data and code to make research more reproducible and extensible. Existing evidence shows that adopting Open Science practices has effects in several domains. In this study, we investigate whether adopting one or more Open Science practices leads to significantly higher citations for an associated publication, which is one form of academic impact. We use a novel dataset known as Open Science Indicators, produced by PLOS and DataSeer, which includes all PLOS publications from 2018 to 2023 as well as a comparison group sampled from the PMC Open Access Subset. In total, we analyze circa 122'000 publications. We calculate publication and author-level citation indicators and use a broad set of control variables to isolate the effect of Open Science Indicators on received citations. We show that Open Science practices are adopted to different degrees across scientific disciplines. We find that the early release of a publication as a preprint correlates with a significant positive citation advantage of about 20.2% on average. We also find that sharing data in an online repository correlates with a smaller yet still positive citation advantage of 4.3% on average. However, we do not find a significant citation advantage for sharing code. Further research is needed on additional or alternative measures of impact beyond citations. Our results are likely to be of interest to researchers, as well as publishers, research funders, and policymakers.
翻译:推动科学研究更加开放的趋势已获得社会各界利益相关方的广泛关注。开放科学实践包括但不限于通过预印本早期分享研究成果,以及公开共享数据、代码等产出,以提升研究的可重复性和可扩展性。现有证据表明,采纳开放科学实践会在多个领域产生效应。本研究旨在探究采纳一项或多项开放科学实践是否会导致相关出版物获得显著更高的引用次数——这是学术影响力的表现形式之一。我们采用PLOS与DataSeer联合构建的新型数据集"开放科学指标",该数据集包含2018年至2023年间PLOS所有出版物,以及从PMC开放获取子集中抽取的对照组样本。我们总计分析了约12.2万篇出版物,计算了出版物层面和作者层面的引文指标,并通过广泛的控制变量集合来分离开放科学指标对引用量的影响效应。研究显示,各学科领域对开放科学实践的采纳程度存在差异。我们发现,以预印本形式提前发布论文与平均约20.2%的显著正向引文优势相关;将数据存储在在线存储库中与平均4.3%的较小但仍显著的正向引文优势相关。然而,我们未发现共享代码能带来显著的引文优势。关于引文之外的其他或替代性影响力指标仍需进一步研究。本研究结果有望为研究人员、出版商、科研资助机构及政策制定者提供重要参考。