Hot fixes are urgent, unplanned changes deployed to production systems to address time-critical issues. Despite their importance, no existing evaluation benchmark focuses specifically on hot fixes. We present HotBugs.jar, the first dataset dedicated to real-world hot fixes. From an initial mining of 10 active Apache projects totaling over 190K commits and 150K issue reports, we identified 746 software patches that met our hot-fix criteria. After manual evaluation, 679 were confirmed as genuine hot fixes, of which 110 are reproducible using a test suite. Building upon the Bugs.jar framework, HotBugs.jar integrates these 110 reproducible cases and makes available all 679 manually validated hot fixes, each enriched with comprehensive metadata to support future research. Each hot fix was systematically identified using Jira issue data, validated by independent reviewers, and packaged in a reproducible format with buggy and fixed versions, test suites, and metadata. HotBugs.jar has already been adopted as the official challenge dataset for the Search-Based Software Engineering (SBSE) Conference Challenge Track, demonstrating its immediate impact. This benchmark enables the study and evaluation of tools for rapid debugging, automated repair, and production-grade resilience in modern software systems to drive research in this essential area forward.
翻译:热修复是为解决时效性关键问题而紧急部署至生产系统的非计划性变更。尽管其重要性显著,目前尚无专门针对热修复的评估基准。本文提出HotBugs.jar——首个专注于真实场景热修复的数据集。通过对10个活跃Apache项目的初始挖掘(总计超19万次提交和15万份问题报告),我们识别出746个符合热修复标准的软件补丁。经人工评估后,确认其中679个为真实热修复案例,其中110个可通过测试套件复现。基于Bugs.jar框架构建的HotBugs.jar整合了这110个可复现案例,并提供全部679个经人工验证的热修复数据,每个案例均附有支持未来研究的完整元数据。每个热修复均通过Jira问题数据系统化识别,经独立评审员验证,并以可复现格式封装(包含缺陷版本与修复版本、测试套件及元数据)。HotBugs.jar已被采纳为基于搜索的软件工程(SBSE)会议挑战赛道的官方挑战数据集,彰显了其即时影响力。该基准数据集支持对现代软件系统中快速调试、自动修复及生产级弹性工具的研究与评估,将推动这一关键领域的研究进展。