A foundation model utilizing chest CT volumes and radiology reports for supervised-level zero-shot detection of abnormalities

Ibrahim Ethem Hamamci,Sezgin Er,Furkan Almas,Ayse Gulnihan Simsek,Sevval Nil Esirgun,Irem Dogan,Muhammed Furkan Dasdelen,Bastian Wittmann,Enis Simsar,Mehmet Simsar,Emine Bensu Erdemir,Abdullah Alanbay,Anjany Sekuboyina,Berkan Lafci,Mehmet K. Ozdemir,Bjoern Menze

A major challenge in computational research in 3D medical imaging is the lack of comprehensive datasets. Addressing this issue, our study introduces CT-RATE, the first 3D medical imaging dataset that pairs images with textual reports. CT-RATE consists of 25,692 non-contrast chest CT volumes, expanded to 50,188 through various reconstructions, from 21,304 unique patients, along with corresponding radiology text reports. Leveraging CT-RATE, we developed CT-CLIP, a CT-focused contrastive language-image pre-training framework. As a versatile, self-supervised model, CT-CLIP is designed for broad application and does not require task-specific training. Remarkably, CT-CLIP outperforms state-of-the-art, fully supervised methods in multi-abnormality detection across all key metrics, thus eliminating the need for manual annotation. We also demonstrate its utility in case retrieval, whether using imagery or textual queries, thereby advancing knowledge dissemination. The open-source release of CT-RATE and CT-CLIP marks a significant advancement in medical AI, enhancing 3D imaging analysis and fostering innovation in healthcare.

翻译：三维医学影像计算研究面临的主要挑战之一是缺乏全面的数据集。针对这一问题，本研究引入CT-RATE——首个将影像与文本报告配对的三维医学影像数据集。该数据集包含来自21,304名独立患者的25,692个非增强胸部CT容积（通过多种重构扩展至50,188个），同时配有相应的放射学文本报告。基于CT-RATE，我们开发了CT-CLIP——聚焦CT影像的对比语言-影像预训练框架。作为通用型自监督模型，CT-CLIP无需任务特定训练即可广泛应用。值得注意的是，在多异常检测任务的全部关键指标上，CT-CLIP均超越现有最先进的完全监督方法，从而消除了对人工标注的需求。我们还展示了其在病例检索中的应用价值——无论使用影像还是文本查询，均能推动知识传播。CT-RATE与CT-CLIP的开源发布标志着医学人工智能的重要进展，将增强三维影像分析能力并促进医疗保健领域的创新。

相关内容

MoDELS

关注 46

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

《生成式模型: 变分自编码器与扩散模型》，75页ppt，Google DeepMind科学家Ruiqi Gao

专知会员服务

66+阅读 · 2023年6月10日

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

生成性对抗网络:理论模型、评估指标和最近发展的概述，Generative Adversarial Networks (GANs): An Overview of Theoretical Model, Evaluation Metrics, and Recent Developments

专知会员服务

42+阅读 · 2020年5月30日