度量革命：生物医学图像分割度量实施的开创性见解 (Metrics Revolutions: Groundbreaking Insights into the Implementation of Metrics for Biomedical Image Segmentation)

The evaluation of segmentation performance is a common task in biomedical image analysis, with its importance emphasized in the recently released metrics selection guidelines and computing frameworks. To quantitatively evaluate the alignment of two segmentations, researchers commonly resort to counting metrics, such as the Dice similarity coefficient, or distance-based metrics, such as the Hausdorff distance, which are usually computed by publicly available open-source tools with an inherent assumption that these tools provide consistent results. In this study we questioned this assumption, and performed a systematic implementation analysis along with quantitative experiments on real-world clinical data to compare 11 open-source tools for distance-based metrics computation against our highly accurate mesh-based reference implementation. The results revealed that statistically significant differences among all open-source tools are both surprising and concerning, since they question the validity of existing studies. Besides identifying the main sources of variation, we also provide recommendations for distance-based metrics computation.

翻译：分割性能评估是生物医学图像分析中的常见任务，其重要性在近期发布的度量选择指南与计算框架中得到强调。为定量评估两个分割结果的对齐程度，研究者通常采用计数型度量（如Dice相似系数）或距离型度量（如豪斯多夫距离），这些度量一般通过公开开源工具计算，并隐含假设这些工具能提供一致结果。本研究对此假设提出质疑，通过系统性的实现分析与真实临床数据的定量实验，将11种距离型度量计算开源工具与我们高精度网格化参考实现进行对比。结果表明所有开源工具间存在统计学显著差异，这一现象既令人惊讶也值得警惕，因其对现有研究的有效性提出了根本性质疑。除识别主要变异来源外，本研究还提出了距离型度量计算的实施建议。

相关内容

TOOLS

关注 1

这个新版本的工具会议系列恢复了从1989年到2012年的50个会议的传统。工具最初是“面向对象语言和系统的技术”，后来发展到包括软件技术的所有创新方面。今天许多最重要的软件概念都是在这里首次引入的。2019年TOOLS 50+1在俄罗斯喀山附近举行，以同样的创新精神、对所有与软件相关的事物的热情、科学稳健性和行业适用性的结合以及欢迎该领域所有趋势和社区的开放态度，延续了该系列。官网链接：http://tools2019.innopolis.ru/

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

31+阅读 · 2021年9月29日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日