SAM系列模型在CT扫描中骨分割的零样本能力 (Zero-shot capability of SAM-family models for bone segmentation in CT scans)

The Segment Anything Model (SAM) and similar models build a family of promptable foundation models (FMs) for image and video segmentation. The object of interest is identified using prompts, such as bounding boxes or points. With these FMs becoming part of medical image segmentation, extensive evaluation studies are required to assess their strengths and weaknesses in clinical setting. Since the performance is highly dependent on the chosen prompting strategy, it is important to investigate different prompting techniques to define optimal guidelines that ensure effective use in medical image segmentation. Currently, no dedicated evaluation studies exist specifically for bone segmentation in CT scans, leaving a gap in understanding the performance for this task. Thus, we use non-iterative, ``optimal'' prompting strategies composed of bounding box, points and combinations to test the zero-shot capability of SAM-family models for bone CT segmentation on three different skeletal regions. Our results show that the best settings depend on the model type and size, dataset characteristics and objective to optimize. Overall, SAM and SAM2 prompted with a bounding box in combination with the center point for all the components of an object yield the best results across all tested settings. As the results depend on multiple factors, we provide a guideline for informed decision-making in 2D prompting with non-interactive, ''optimal'' prompts.

翻译：Segment Anything Model（SAM）及类似模型构建了一个可提示的基础模型家族，用于图像与视频分割。目标对象通过提示（如边界框或点）进行识别。随着这些基础模型成为医学图像分割的一部分，需要开展广泛的评估研究以衡量其在临床环境中的优势与不足。由于模型性能高度依赖于所选的提示策略，研究不同的提示技术以制定确保医学图像分割有效使用的最优指导原则至关重要。目前，专门针对CT扫描骨分割的评估研究尚属空白，导致对该任务性能的理解存在不足。因此，我们采用由边界框、点及其组合构成的非迭代式“最优”提示策略，在三个不同骨骼区域上测试SAM系列模型对骨CT分割的零样本能力。结果表明，最佳设置取决于模型类型与规模、数据集特性以及待优化的目标。总体而言，采用边界框结合目标所有组成部分中心点的提示方式，SAM和SAM2在所有测试设置中均取得最佳结果。鉴于结果受多重因素影响，我们为使用非交互式“最优”提示进行二维提示决策提供了指导原则。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

《生成式模型: 变分自编码器与扩散模型》，75页ppt，Google DeepMind科学家Ruiqi Gao

专知会员服务

66+阅读 · 2023年6月10日

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

最新《Transformers模型》教程，64页ppt

专知会员服务

326+阅读 · 2020年11月26日