The rapid adoption of large language models has led to the emergence of AI coding agents that autonomously create pull requests on GitHub. However, how these agents differ in their pull request description characteristics, and how human reviewers respond to them, remains underexplored. In this study, we conduct an empirical analysis of pull requests created by five AI coding agents using the AIDev dataset. We analyze agent differences in pull request description characteristics, including structural features, and examine human reviewer response in terms of review activity, response timing, sentiment, and merge outcomes. We find that AI coding agents exhibit distinct PR description styles, which are associated with differences in reviewer engagement, response time, and merge outcomes. We observe notable variation across agents in both reviewer interaction metrics and merge rates. These findings highlight the role of pull request presentation and reviewer interaction dynamics in human-AI collaborative software development.
翻译:大型语言模型的快速应用催生了能在GitHub上自主创建pull request的AI编程代理。然而,这些代理在pull request描述特征上的差异,以及人工审查者对其的响应方式,目前仍未得到充分探索。本研究基于AIDev数据集,对五种AI编程代理创建的pull request进行了实证分析。我们分析了代理在pull request描述特征(包括结构特征)上的差异,并从审查活动、响应时效、情感倾向与合并结果等维度考察了人工审查者的响应情况。研究发现,AI编程代理展现出具有差异性的PR描述风格,这些风格与审查者参与度、响应时间和合并结果的差异存在关联。我们观察到不同代理在审查者互动指标与合并率方面均存在显著差异。这些发现揭示了pull request呈现方式与审查者互动动态在人机协同软件开发中的重要作用。