Open-source Software (OSS) has become a valuable resource in both industry and academia over the last few decades. Despite the innovative structures they develop to support the projects, OSS projects and their communities have complex needs and face risks such as getting abandoned. To manage the internal social dynamics and community evolution, OSS developer communities have started relying on written governance documents that assign roles and responsibilities to different community actors. To facilitate the study of the impact and effectiveness of formal governance documents on OSS projects and communities, we present a longitudinal dataset of 710 GitHub-hosted OSS projects with \path{GOVERNANCE.MD} governance files. This dataset includes all commits made to the repository, all issues and comments created on GitHub, and all revisions made to the governance file. We hope its availability will foster more research interest in studying how OSS communities govern their projects and the impact of governance files on communities.
翻译:过去几十年中,开源软件已成为工业界和学术界的重要资源。尽管开源软件项目及其社区开发了支持项目的创新架构,但它们仍面临复杂的需求和风险,例如项目被遗弃。为管理内部社会动态和社区演变,开源软件开发者社区开始依赖书面治理文件,这些文件为不同社区参与者分配角色与责任。为促进研究正式治理文件对开源软件项目及社区的影响与有效性,我们提供了一个包含710个托管于GitHub的开源项目(配备`GOVERNANCE.MD`治理文件)的纵向数据集。该数据集涵盖仓库的所有提交记录、GitHub上创建的所有议题与评论,以及治理文件的所有修订版本。我们期望该数据的公开能激发更多关于开源社区如何治理项目及其治理文件影响力的研究兴趣。