CommitBench: A Benchmark for Commit Message Generation

Writing commit messages is a tedious daily task for many software developers, and often remains neglected. Automating this task has the potential to save time while ensuring that messages are informative. A high-quality dataset and an objective benchmark are vital preconditions for solid research and evaluation towards this goal. We show that existing datasets exhibit various problems, such as the quality of the commit selection, small sample sizes, duplicates, privacy issues, and missing licenses for redistribution. This can lead to unusable models and skewed evaluations, where inferior models achieve higher evaluation scores due to biases in the data. We compile a new large-scale dataset, CommitBench, adopting best practices for dataset creation. We sample commits from diverse projects with licenses that permit redistribution and apply our filtering and dataset enhancements to improve the quality of generated commit messages. We use CommitBench to compare existing models and show that other approaches are outperformed by a Transformer model pretrained on source code. We hope to accelerate future research by publishing the source code( https://github.com/Maxscha/commitbench ).

翻译：编写提交信息是许多软件开发人员日常工作中繁琐的任务，且常被忽视。自动化该任务既能节省时间，又能确保提交信息具有信息量。高质量数据集和客观基准对于实现这一目标的可靠研究与评估至关重要。我们指出现有数据集存在多种问题，例如提交选择质量、样本量小、重复、隐私问题以及缺少允许重新分发的许可证。这些问题可能导致模型不可用与评估偏差——劣质模型因数据中的偏差反而获得更高的评估分数。我们采用数据集创建的最佳实践，整理出大规模新数据集CommitBench。我们从允许重新分发的许可协议项目中采样提交，并通过过滤与数据集增强手段提升生成提交信息的质量。利用CommitBench对现有模型进行比较，结果表明，基于源代码预训练的Transformer模型性能优于其他方法。我们通过公开源代码（https://github.com/Maxscha/commitbench）以期加速未来研究。

相关内容

数据集

关注 88

数据集，又称为资料集、数据集合或资料集合，是一种由数据所组成的集合。
Data set（或dataset）是一个数据的集合，通常以表格形式出现。每一列代表一个特定变量。每一行都对应于某一成员的数据集的问题。它列出的价值观为每一个变量，如身高和体重的一个物体或价值的随机数。每个数值被称为数据资料。对应于行数，该数据集的数据可能包括一个或多个成员。

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日