AI coding agents are increasingly acting as autonomous contributors by generating and submitting pull requests (PRs). However, we lack empirical evidence on how these agent-generated PRs differ from human contributions, particularly in how they modify code and describe their changes. Understanding these differences is essential for assessing their reliability and impact on development workflows. Using the MSR 2026 Mining Challenge version of the AIDev dataset, we analyze 24,014 merged Agentic PRs (440,295 commits) and 5,081 merged Human PRs (23,242 commits). We examine additions, deletions, commits, and files touched, and evaluate the consistency between PR descriptions and their diffs using lexical and semantic similarity. Agentic PRs differ substantially from Human PRs in commit count (Cliff's $δ= 0.5429$) and show moderate differences in files touched and deleted lines. They also exhibit slightly higher description-to-diff similarity across all measures. These findings provide a large-scale empirical characterization of how AI coding agents contribute to open source development.
翻译:AI编码代理正日益通过生成和提交拉取请求(PRs)成为自主贡献者。然而,我们缺乏关于这些代理生成的PRs与人类贡献有何差异的实证证据,尤其是在其修改代码和描述变更的方式上。理解这些差异对于评估其可靠性及对开发工作流程的影响至关重要。利用AIDev数据集的MSR 2026挖掘挑战版本,我们分析了24,014个已合并的代理PRs(440,295次提交)和5,081个已合并的人类PRs(23,242次提交)。我们考察了新增、删除、提交及涉及的文件,并使用词汇和语义相似度评估了PR描述与其差异文件之间的一致性。代理PRs在提交数量上与人类PRs存在显著差异(Cliff's $δ= 0.5429$),在涉及文件和删除行数上表现出中等程度差异,且在所有度量指标上均呈现略高的描述-差异相似度。这些发现为AI编码代理如何参与开源开发提供了大规模实证特征描述。