Robotic-assisted tracheal intubation requires the robot to distinguish anatomical features like an experienced physician using deep-learning techniques. However, real datasets of oropharyngeal organs are limited due to patient privacy issues, making it challenging to train deep-learning models for accurate image segmentation. We hereby consider generating a new data modality through a virtual environment to assist the training process. Specifically, this work introduces a virtual dataset generated by the Simulation Open Framework Architecture (SOFA) framework to overcome the limited availability of actual endoscopic images. We also propose a domain adaptive Sim-to-Real method for oropharyngeal organ image segmentation, which employs an image blending strategy called IoU-Ranking Blend (IRB) and style-transfer techniques to address discrepancies between datasets. Experimental results demonstrate the superior performance of the proposed approach with domain adaptive models, improving segmentation accuracy and training stability. In the practical application, the trained segmentation model holds great promise for robot-assisted intubation surgery and intelligent surgical navigation.
翻译:机器人辅助气管插管要求机器人像经验丰富的医生一样,利用深度学习技术区分解剖特征。然而,由于患者隐私问题,口咽器官的真实数据集十分有限,这使得训练深度学习模型实现精确图像分割具有挑战性。本文考虑通过虚拟环境生成一种新的数据模态,以辅助训练过程。具体而言,本研究引入了由仿真开放框架架构(SOFA)框架生成的虚拟数据集,以克服真实内窥镜图像可用性有限的问题。我们还提出了一种领域自适应的仿真到真实(Sim-to-Real)方法,用于口咽器官图像分割,该方法采用了一种称为IoU排序混合(IRB)的图像混合策略以及风格迁移技术,以解决数据集之间的差异。实验结果表明,采用领域自适应模型所提出的方法性能优越,提高了分割精度和训练稳定性。在实际应用中,训练好的分割模型对于机器人辅助插管手术和智能手术导航具有巨大潜力。