In previous work, we deployed IssueTAG, which uses the texts present in the one-line summary and the description fields of the issue reports to automatically assign them to the stakeholders, who are responsible for resolving the reported issues. Since its deployment on January 12, 2018 at Softtech, i.e., the software subsidiary of the largest private bank in Turkey, IssueTAG has made a total of 301,752 assignments (as of November 2021). One observation we make is that a large fraction of the issue reports submitted to Softtech has screenshot attachments and, in the presence of such attachments, the reports often convey less information in their one-line summary and the description fields, which tends to reduce the assignment accuracy. In this work, we use the screenshot attachments as an additional source of information to further improve the assignment accuracy, which (to the best of our knowledge) has not been studied before in this context. In particular, we develop a number of multi-source (using both the issue reports and the screenshot attachments) and single-source assignment models (using either the issue reports or the screenshot attachments) and empirically evaluate them on real issue reports. In the experiments, compared to the currently deployed single-source model in the field, the best multi-source model developed in this work, significantly (both in the practical and statistical sense) improved the assignment accuracy for the issue reports with screenshot attachments from 0.843 to 0.858 at acceptable overhead costs, a result strongly supporting our basic hypothesis.
翻译:在先前工作中,我们部署了IssueTAG系统,该系统利用问题报告中单行摘要和描述字段的文本内容,自动将问题分配给负责解决报告问题的相关方。自2018年1月12日在土耳其最大私营银行软件子公司Softtech部署以来,截至2021年11月,IssueTAG已累计完成301,752次分配任务。我们观察到,提交至Softtech的大量问题报告均包含屏幕截图附件,且在存在此类附件的情况下,报告的单行摘要和描述字段所传达的信息量往往减少,这倾向于降低分配准确率。在本研究中,我们将屏幕截图附件作为额外信息源以进一步提升分配准确率——据我们所知,此研究方向此前在该领域尚未被探讨。具体而言,我们开发了多种多源(同时使用问题报告与屏幕截图附件)和单源(仅使用问题报告或屏幕截图附件)分配模型,并在真实问题报告上进行了实证评估。实验结果表明,与当前现场部署的单源模型相比,本研究开发的最佳多源模型在可接受的开销成本下,将包含屏幕截图附件的问题报告分配准确率从0.843显著提升至0.858(在实践和统计意义上均具有显著性),这一结果有力支持了我们的基本假设。