Calls to make scientific research more open have gained traction with a range of societal stakeholders. Open Science practices include but are not limited to the early sharing of results via preprints and openly sharing outputs such as data and code to make research more reproducible and extensible. Existing evidence shows that adopting Open Science practices has effects in several domains. In this study, we investigate whether adopting one or more Open Science practices leads to significantly higher citations for an associated publication, which is one form of academic impact. We use a novel dataset known as Open Science Indicators, produced by PLOS and DataSeer, which includes all PLOS publications from 2018 to 2023 as well as a comparison group sampled from the PMC Open Access Subset. In total, we analyze circa 122'000 publications. We calculate publication and author-level citation indicators and use a broad set of control variables to isolate the effect of Open Science Indicators on received citations. We show that Open Science practices are adopted to different degrees across scientific disciplines. We find that the early release of a publication as a preprint correlates with a significant positive citation advantage of about 20.2% on average. We also find that sharing data in an online repository correlates with a smaller yet still positive citation advantage of 4.3% on average. However, we do not find a significant citation advantage for sharing code. Further research is needed on additional or alternative measures of impact beyond citations. Our results are likely to be of interest to researchers, as well as publishers, research funders, and policymakers.
翻译:推动科学研究更加开放的呼声已获得社会各界利益相关者的广泛支持。开放科学实践包括但不限于通过预印本提前分享研究成果,以及公开共享数据和代码等产出,以提升研究的可复现性与可扩展性。现有证据表明,采用开放科学实践在多个领域均能产生积极影响。本研究旨在探究采用一项或多项开放科学实践是否会显著提升相关出版物的引文数量——这是学术影响力的一种表现形式。我们采用由PLOS与DataSeer共同构建的新型数据集“开放科学指标”,该数据集涵盖2018年至2023年间所有PLOS出版物,以及从PMC开放获取子集中采样的对照组出版物,总计分析约12.2万篇文献。通过计算出版物层面与作者层面的引文指标,并采用广泛的控制变量集以隔离开放科学指标对引文数量的影响,我们发现:开放科学实践在不同学科领域的采纳程度存在差异。研究显示,以预印本形式提前发布出版物平均可带来约20.2%的显著正向引文优势;在线存储库中共享数据虽影响幅度较小,但仍能产生平均4.3%的正向引文优势。然而,共享代码并未表现出显著的引文优势。未来研究需在引文之外探索更多元化的影响力衡量指标。本研究结果对科研人员、出版商、资助机构及政策制定者均具有重要参考价值。