OpenMIBOOD: Open Medical Imaging Benchmarks for Out-Of-Distribution Detection

The growing reliance on Artificial Intelligence (AI) in critical domains such as healthcare demands robust mechanisms to ensure the trustworthiness of these systems, especially when faced with unexpected or anomalous inputs. This paper introduces the Open Medical Imaging Benchmarks for Out-Of-Distribution Detection (OpenMIBOOD), a comprehensive framework for evaluating out-of-distribution (OOD) detection methods specifically in medical imaging contexts. OpenMIBOOD includes three benchmarks from diverse medical domains, encompassing 14 datasets divided into covariate-shifted in-distribution, near-OOD, and far-OOD categories. We evaluate 24 post-hoc methods across these benchmarks, providing a standardized reference to advance the development and fair comparison of OOD detection methods. Results reveal that findings from broad-scale OOD benchmarks in natural image domains do not translate to medical applications, underscoring the critical need for such benchmarks in the medical field. By mitigating the risk of exposing AI models to inputs outside their training distribution, OpenMIBOOD aims to support the advancement of reliable and trustworthy AI systems in healthcare. The repository is available at https://github.com/remic-othr/OpenMIBOOD.

翻译：随着人工智能（AI）在医疗等关键领域日益普及，亟需建立稳健的机制以确保这些系统的可信度，特别是在面对意外或异常输入时。本文提出了用于分布外检测的开放医学影像基准（OpenMIBOOD），这是一个专门用于评估医学影像场景中分布外（OOD）检测方法的综合性框架。OpenMIBOOD包含来自不同医学领域的三个基准，涵盖14个数据集，这些数据集被划分为协变量偏移的分布内、近分布外和远分布外三类。我们使用24种后处理方法对这些基准进行了评估，为推进OOD检测方法的发展和公平比较提供了标准化参考。结果表明，自然图像领域大规模OOD基准的研究结论并不能直接迁移到医学应用中，这凸显了医学领域建立此类基准的迫切需求。通过降低AI模型暴露于训练分布之外输入的风险，OpenMIBOOD旨在支持医疗领域可靠且可信的AI系统的发展。该资源库可通过 https://github.com/remic-othr/OpenMIBOOD 访问。

相关内容

关注 7104

人工智能杂志AI(Artificial Intelligence)是目前公认的发表该领域最新研究成果的主要国际论坛。该期刊欢迎有关AI广泛方面的论文，这些论文构成了整个领域的进步，也欢迎介绍人工智能应用的论文，但重点应该放在新的和新颖的人工智能方法如何提高应用领域的性能，而不是介绍传统人工智能方法的另一个应用。关于应用的论文应该描述一个原则性的解决方案，强调其新颖性，并对正在开发的人工智能技术进行深入的评估。官网地址：http://dblp.uni-trier.de/db/journals/ai/

《生成式模型: 变分自编码器与扩散模型》，75页ppt，Google DeepMind科学家Ruiqi Gao

专知会员服务

66+阅读 · 2023年6月10日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日