Drawing from discussions at the inaugural DMLR workshop at ICML 2023 and meetings prior, in this report we outline the relevance of community engagement and infrastructure development for the creation of next-generation public datasets that will advance machine learning science. We chart a path forward as a collective effort to sustain the creation and maintenance of these datasets and methods towards positive scientific, societal and business impact.
翻译:基于ICML 2023首届DMLR研讨会及前期会议的讨论,本报告阐述了社区参与和基础设施开发对于构建下一代公共数据集以推动机器学习科学发展的重要意义。我们规划了一条通过集体努力持续创建和维护这些数据集及方法的路径,旨在实现积极的科学、社会与商业影响。