Background: Code review, the discussion around a code change among humans, forms a communication network that enables its participants to exchange and spread information. Although reported by qualitative studies, our understanding of the capability of code review as a communication network is still limited. Objective: In this article, we report on a first step towards evaluating the capability of code review as a communication network by quantifying how fast and how far information can spread through code review: the upper bound of information diffusion in code review. Method: In an in-silico experiment, we simulate an artificial information diffusion within large (Microsoft), mid-sized (Spotify), and small code review systems (Trivago) modelled as communication networks. We then measure the minimal topological and temporal distances between the participants to quantify how far and how fast information can spread in code review. Results: An average code review participants in the small and mid-sized code review systems can spread information to between 72% and 85% of all code review participants within four weeks independently of network size and tooling; for the large code review systems, we found an absolute boundary of about 11000 reachable participants. On average (median), information can spread between two participants in code review in less than five hops and less than five days. Conclusion: We found evidence that the communication network emerging from code review scales well and spreads information fast and broadly, corroborating the findings of prior qualitative work. The study lays the foundation for understanding and improving code review as a communication network.
翻译:背景:代码审查——人类围绕代码更改展开的讨论——形成了一种通信网络,使参与者能够交换和传播信息。尽管定性研究已有所报告,但我们对代码审查作为通信网络的能力理解仍然有限。目的:本文通过量化信息通过代码审查扩散的速度和范围,即代码审查中信息扩散的上界,迈出了评估代码审查通信网络能力的第一步。方法:通过计算机模拟实验,我们在建模为通信网络的大型(Microsoft)、中型(Spotify)和小型(Trivago)代码审查系统中模拟人工信息扩散。随后测量参与者之间的最小拓扑距离和时间距离,以量化信息在代码审查中的扩散速度和范围。结果:在小型和中型代码审查系统中,平均而言,无论网络规模或工具如何,参与者可在四周内将信息扩散至72%至85%的代码审查参与者;对于大型代码审查系统,我们发现可到达参与者的绝对边界约为11000人。平均而言(中位数),信息在代码审查中两个参与者之间可在少于5跳且少于5天内扩散。结论:证据表明,代码审查中形成的通信网络具有良好的可扩展性,能够快速且广泛地传播信息,这证实了先前定性研究的发现。本研究为理解并改进作为通信网络的代码审查奠定了基础。