F2FLDM: Latent Diffusion Models with Histopathology Pre-Trained Embeddings for Unpaired Frozen Section to FFPE Translation

The Frozen Section (FS) technique is a rapid and efficient method, taking only 15-30 minutes to prepare slides for pathologists' evaluation during surgery, enabling immediate decisions on further surgical interventions. However, FS process often introduces artifacts and distortions like folds and ice-crystal effects. In contrast, these artifacts and distortions are absent in the higher-quality formalin-fixed paraffin-embedded (FFPE) slides, which require 2-3 days to prepare. While Generative Adversarial Network (GAN)-based methods have been used to translate FS to FFPE images (F2F), they may leave morphological inaccuracies with remaining FS artifacts or introduce new artifacts, reducing the quality of these translations for clinical assessments. In this study, we benchmark recent generative models, focusing on GANs and Latent Diffusion Models (LDMs), to overcome these limitations. We introduce a novel approach that combines LDMs with Histopathology Pre-Trained Embeddings to enhance restoration of FS images. Our framework leverages LDMs conditioned by both text and pre-trained embeddings to learn meaningful features of FS and FFPE histopathology images. Through diffusion and denoising techniques, our approach not only preserves essential diagnostic attributes like color staining and tissue morphology but also proposes an embedding translation mechanism to better predict the targeted FFPE representation of input FS images. As a result, this work achieves a significant improvement in classification performance, with the Area Under the Curve rising from 81.99% to 94.64%, accompanied by an advantageous CaseFD. This work establishes a new benchmark for FS to FFPE image translation quality, promising enhanced reliability and accuracy in histopathology FS image analysis. Our work is available at https://minhmanho.github.io/f2f_ldm/.

翻译：冰冻切片（FS）技术是一种快速高效的方法，仅需15-30分钟即可制备病理学家术中评估所需的切片，从而能即时决定后续手术方案。然而，FS过程常引入褶皱、冰晶效应等伪影和形变。相比之下，更高质量的石蜡包埋（FFPE）切片需要2-3天制备，却不存在这些伪影与变形。尽管基于生成对抗网络（GAN）的方法已被用于将FS图像转换为FFPE图像（F2F），但这类方法可能残留FS伪影导致形态学不准确，或引入新伪影，降低了临床评估中翻译图像的质量。本研究对最新生成模型进行基准测试，重点关注GAN和潜扩散模型（LDM）以突破这些局限。我们提出一种创新方法，将LDM与组织病理学预训练嵌入相结合，增强FS图像恢复能力。本框架利用文本和预训练嵌入共同条件化的LDM，学习FS与FFPE组织病理学图像的有意义特征。通过扩散与去噪技术，该方法不仅能保留染色颜色、组织形态等关键诊断属性，还提出一种嵌入翻译机制以更好预测输入FS图像对应的目标FFPE表征。最终，本工作在分类性能上取得显著提升，曲线下面积从81.99%提升至94.64%，并伴随有利的CaseFD指标。本研究为FS到FFPE图像翻译质量建立了新基准，有望提升组织病理学FS图像分析的可靠性与准确性。本研究代码及资源见 https://minhmanho.github.io/f2f_ldm/。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【CVPR 2022】基于元内存传输的跨域少镜头语义分割，Remember the Difference: Cross-Domain Few-Shot Semantic Segmentation via Meta-Memory Transfer

专知会员服务

14+阅读 · 2022年3月12日

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

35+阅读 · 2019年10月18日