Reproducibility is a key aspect for scientific advancement across disciplines, and reducing barriers for open science is a focus area for the theme of Interspeech 2023. Availability of source code is one of the indicators that facilitates reproducibility. However, less is known about the rates of reproducibility at Interspeech conferences in comparison to other conferences in the field. In order to fill this gap, we have surveyed 27,717 papers at seven conferences across speech and language processing disciplines. We find that despite having a close number of accepted papers to the other conferences, Interspeech has up to 40% less source code availability. In addition to reporting the difficulties we have encountered during our research, we also provide recommendations and possible directions to increase reproducibility for further studies.
翻译:[摘要] 可复现性是推动跨学科科学进步的关键因素,而降低开放科学门槛是Interspeech 2023会议主题的核心关注领域。源代码可用性作为促进可复现性的重要指标之一,但相较于领域内其他会议,学术界对Interspeech会议论文可复现性比率仍知之甚少。为填补这一研究空白,本研究对语音与语言处理领域七场会议的27,717篇论文进行了系统调查。研究发现:尽管Interspeech会议论文接收数量与其他会议相当,但其源代码可用率却低至40%。除报告研究过程中遇到的困难外,本文还提出提升可复现性的具体建议与未来研究方向。