Weakly Supervised Lesion Detection and Diagnosis for Breast Cancers with Partially Annotated Ultrasound Images

Deep learning (DL) has proven highly effective for ultrasound-based computer-aided diagnosis (CAD) of breast cancers. In an automaticCAD system, lesion detection is critical for the following diagnosis. However, existing DL-based methods generally require voluminous manually-annotated region of interest (ROI) labels and class labels to train both the lesion detection and diagnosis models. In clinical practice, the ROI labels, i.e. ground truths, may not always be optimal for the classification task due to individual experience of sonologists, resulting in the issue of coarse annotation that limits the diagnosis performance of a CAD model. To address this issue, a novel Two-Stage Detection and Diagnosis Network (TSDDNet) is proposed based on weakly supervised learning to enhance diagnostic accuracy of the ultrasound-based CAD for breast cancers. In particular, all the ROI-level labels are considered as coarse labels in the first training stage, and then a candidate selection mechanism is designed to identify optimallesion areas for both the fully and partially annotated samples. It refines the current ROI-level labels in the fully annotated images and the detected ROIs in the partially annotated samples with a weakly supervised manner under the guidance of class labels. In the second training stage, a self-distillation strategy further is further proposed to integrate the detection network and classification network into a unified framework as the final CAD model for joint optimization, which then further improves the diagnosis performance. The proposed TSDDNet is evaluated on a B-mode ultrasound dataset, and the experimental results show that it achieves the best performance on both lesion detection and diagnosis tasks, suggesting promising application potential.

翻译：深度学习在基于超声的乳腺癌计算机辅助诊断中已展现出高效性。在自动诊断系统中，病灶检测对后续诊断至关重要。然而，现有深度学习方法通常需要大量人工标注的感兴趣区域标签和类别标签来训练病灶检测与诊断模型。临床实践中，由于超声医师个体经验差异，作为真值的感兴趣区域标签可能并非分类任务的最优标注，导致粗标注问题限制了计算机辅助诊断模型的诊断性能。为解决此问题，提出基于弱监督学习的双阶段检测与诊断网络，通过提升乳腺癌超声诊断模型的准确率。具体而言，第一阶段将所有感兴趣区域级标签视为粗标签，并设计候选区域选择机制以确定完全标注样本和部分标注样本中的最优病灶区域。该机制在类别标签引导下，以弱监督方式优化完全标注图像中的现有感兴趣区域标签及部分标注样本中的检测区域。第二阶段进一步提出自蒸馏策略，将检测网络与分类网络整合为统一框架，构建用于联合优化的最终计算机辅助诊断模型，从而进一步提升诊断性能。在B型超声数据集上的实验结果表明，该网络在病灶检测与诊断任务中均取得最优性能，展现出良好的应用潜力。

相关内容

CAD

关注 3

《计算机辅助设计》是一份领先的国际期刊，为学术界和工业界提供有关计算机应用于设计的研究和发展的重要论文。计算机辅助设计邀请论文报告新的研究以及新颖或特别重要的应用，在广泛的主题中，跨越所有阶段的设计过程，从概念创造到制造超越。官网地址：http://dblp.uni-trier.de/db/journals/cad/

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日