In the context of deep learning research, where model introductions continually occur, the need for effective and efficient evaluation remains paramount. Existing methods often emphasize accuracy metrics, overlooking stability. To address this, the paper introduces the Accuracy-Stability Index (ASI), a quantitative measure incorporating both accuracy and stability for assessing deep learning models. Experimental results demonstrate the application of ASI, and a 3D surface model is presented for visualizing ASI, mean accuracy, and coefficient of variation. This paper addresses the important issue of quantitative benchmarking metrics for deep learning models, providing a new approach for accurately evaluating accuracy and stability of deep learning models. The paper concludes with discussions on potential weaknesses and outlines future research directions.
翻译:在深度学习研究中,随着新模型的不断涌现,有效且高效的评估方法仍至关重要。现有方法常侧重于准确度指标,而忽视了稳定性。为解决这一问题,本文提出准确-稳定度指数(Accuracy-Stability Index, ASI),这是一种同时考量准确度与稳定性的定量评估指标。实验结果表明了ASI的应用效果,并引入3D曲面模型以可视化ASI、平均准确度及变异系数。本文针对深度学习模型定量基准指标这一重要问题,为准确评估模型的准确度与稳定性提供了新思路。最后,文章讨论了潜在不足并展望了未来研究方向。