Recent advances in whole-slide image (WSI) scanners and computational capabilities have significantly propelled the application of artificial intelligence in histopathology slide analysis. While these strides are promising, current supervised learning approaches for WSI analysis come with the challenge of exhaustively labeling high-resolution slides - a process that is both labor-intensive and time-consuming. In contrast, self-supervised learning (SSL) pretraining strategies are emerging as a viable alternative, given that they don't rely on explicit data annotations. These SSL strategies are quickly bridging the performance disparity with their supervised counterparts. In this context, we introduce an SSL framework. This framework aims for transferable representation learning and semantically meaningful clustering by synergizing invariance loss and clustering loss in WSI analysis. Notably, our approach outperforms common SSL methods in downstream classification and clustering tasks, as evidenced by tests on the Camelyon16 and a pancreatic cancer dataset.
翻译:近年来,全切片图像扫描仪和计算能力的显著进步大幅推动了人工智能在组织病理学切片分析中的应用。尽管这些进展令人振奋,但目前用于全切片图像分析的监督学习方法面临一个挑战:对高分辨率切片进行详尽标注既费时又费力。相比之下,自监督学习预训练策略因其不依赖显式数据标注,正逐渐成为一种可行的替代方案。这些自监督学习策略正在迅速缩小其与监督方法的性能差距。在此背景下,我们提出了一种自监督学习框架。该框架通过协同使用不变性损失和聚类损失,在全切片图像分析中实现可迁移的表示学习和语义有意义的聚类。值得注意的是,我们的方法在Camelyon16和胰腺癌数据集上的下游分类与聚类任务中表现优于常见的自监督学习方法。