Data Augmentation for Surgical Scene Segmentation with Anatomy-Aware Diffusion Models

In computer-assisted surgery, automatically recognizing anatomical organs is crucial for understanding the surgical scene and providing intraoperative assistance. While machine learning models can identify such structures, their deployment is hindered by the need for labeled, diverse surgical datasets with anatomical annotations. Labeling multiple classes (i.e., organs) in a surgical scene is time-intensive, requiring medical experts. Although synthetically generated images can enhance segmentation performance, maintaining both organ structure and texture during generation is challenging. We introduce a multi-stage approach using diffusion models to generate multi-class surgical datasets with annotations. Our framework improves anatomy awareness by training organ specific models with an inpainting objective guided by binary segmentation masks. The organs are generated with an inference pipeline using pre-trained ControlNet to maintain the organ structure. The synthetic multi-class datasets are constructed through an image composition step, ensuring structural and textural consistency. This versatile approach allows the generation of multi-class datasets from real binary datasets and simulated surgical masks. We thoroughly evaluate the generated datasets on image quality and downstream segmentation, achieving a $15\%$ improvement in segmentation scores when combined with real images. The code is available at https://gitlab.com/nct_tso_public/muli-class-image-synthesis

翻译：在计算机辅助手术中，自动识别解剖器官对于理解手术场景和提供术中辅助至关重要。虽然机器学习模型能够识别此类结构，但其部署受到需要具有解剖标注的、多样化的标记手术数据集的限制。在手术场景中标记多个类别（即器官）是耗时的工作，需要医学专家参与。尽管合成生成的图像可以提升分割性能，但在生成过程中同时保持器官结构和纹理具有挑战性。我们提出了一种利用扩散模型生成带标注的多类别手术数据集的多阶段方法。我们的框架通过训练器官特定模型来提升解剖感知能力，该模型以二值分割掩码为指导，以修复为目标。器官的生成采用基于预训练ControlNet的推理流程，以保持器官结构。合成多类别数据集通过图像合成步骤构建，确保了结构和纹理的一致性。这种通用方法允许从真实二值数据集和模拟手术掩码生成多类别数据集。我们全面评估了生成数据集在图像质量和下游分割任务上的表现，当与真实图像结合使用时，分割分数提升了$15\%$。代码可在 https://gitlab.com/nct_tso_public/muli-class-image-synthesis 获取。

相关内容

数据集

关注 88

数据集，又称为资料集、数据集合或资料集合，是一种由数据所组成的集合。
Data set（或dataset）是一个数据的集合，通常以表格形式出现。每一列代表一个特定变量。每一行都对应于某一成员的数据集的问题。它列出的价值观为每一个变量，如身高和体重的一个物体或价值的随机数。每个数值被称为数据资料。对应于行数，该数据集的数据可能包括一个或多个成员。

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

【亚马逊-WWW2020】不解析,生成!用于面向任务的语义分析的序列到序列体系结构，Don't Parse, Generate! A Sequence to Sequence Architecture for Task-Oriented Semantic Parsing

专知会员服务

15+阅读 · 2020年2月1日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日