The equitable distribution of academic data is crucial for ensuring equal research opportunities, and ultimately further progress. Yet, due to the complexity of using the API for audio data that corresponds to the Million Song Dataset along with its misreporting (before 2016) and the discontinuation of this API (after 2016), access to this data has become restricted to those within certain affiliations that are connected peer-to-peer. In this paper, we delve into this issue, drawing insights from the experiences of 22 individuals who either attempted to access the data or played a role in its creation. With this, we hope to initiate more critical dialogue and more thoughtful consideration with regard to access privilege in the MIR community.
翻译:学术数据的公平分配对于确保平等的研究机会以及最终的进一步进展至关重要。然而,由于对应百万歌曲数据集的音频数据API使用复杂,加之其2016年前的错误报告问题以及2016年后API的停用,这些数据的访问权限已局限于通过点对点连接的特定机构内部人员。在本文中,我们基于22位试图访问该数据或参与其创建者的经验,深入探讨了这一问题。借此,我们希望能在音乐信息检索(MIR)社区中引发更多关于访问特权的批判性讨论与更深入的思考。