Open digital public infrastructure needs community management to ensure accountability, sustainability, and robustness. Yet open-source projects often rely on centralized decision-making, and the determinants of successful community management remain unclear. We analyze 637 GitHub repositories to trace transitions from founder-led to shared governance. Specifically, we document trajectories to community governance by extracting institutional roles, actions, and deontic cues from version-controlled project constitutions GOVERNANCE .md. With a semantic parsing pipeline, we cluster elements into broader role and action types. We find roles and actions grow, and regulation becomes more balanced, reflecting increases in governance scope and differentiation over time. Rather than shifting tone, communities grow by layering and refining responsibilities. As transitions to community management mature, projects increasingly regulate ecosystem-level relationships and add definition to project oversight roles. Overall, this work offers a scalable pipeline for tracking the growth and development of community governance regimes from open-source software's familiar default of founder-ownership.
翻译:开放的数字化公共基础设施需要社区管理以确保其问责性、可持续性与稳健性。然而开源项目往往依赖集中式决策,且成功社区管理的决定因素仍不明确。本研究通过分析637个GitHub代码库,追踪从创始人主导模式向共享治理模式的转型过程。具体而言,我们通过从版本控制的项目章程GOVERNANCE.md文件中提取制度性角色、行动及道义指示标记,系统记录了向社区治理演进的发展轨迹。借助语义解析流程,我们将这些要素聚类为更广义的角色类型与行动类别。研究发现:角色与行动类别逐渐扩展,监管机制趋于平衡,这反映了治理范围与分化程度随时间推移而增强的趋势。社区发展并非通过转变基调实现,而是通过分层细化和完善责任体系达成。随着向社区管理的转型日趋成熟,项目日益加强对生态系统层面关系的规范,并对项目监督角色赋予更明确的定义。总体而言,本研究提出了一种可扩展的分析流程,能够从开源软件默认的创始人所有权模式出发,系统追踪社区治理体系的成长与发展轨迹。