BrainSegFounder: Towards 3D Foundation Models for Neuroimage Segmentation

The burgeoning field of brain health research increasingly leverages artificial intelligence (AI) to interpret and analyze neurological data. This study introduces a novel approach towards the creation of medical foundation models by integrating a large-scale multi-modal magnetic resonance imaging (MRI) dataset derived from 41,400 participants in its own. Our method involves a novel two-stage pretraining approach using vision transformers. The first stage is dedicated to encoding anatomical structures in generally healthy brains, identifying key features such as shapes and sizes of different brain regions. The second stage concentrates on spatial information, encompassing aspects like location and the relative positioning of brain structures. We rigorously evaluate our model, BrainFounder, using the Brain Tumor Segmentation (BraTS) challenge and Anatomical Tracings of Lesions After Stroke v2.0 (ATLAS v2.0) datasets. BrainFounder demonstrates a significant performance gain, surpassing the achievements of the previous winning solutions using fully supervised learning. Our findings underscore the impact of scaling up both the complexity of the model and the volume of unlabeled training data derived from generally healthy brains, which enhances the accuracy and predictive capabilities of the model in complex neuroimaging tasks with MRI. The implications of this research provide transformative insights and practical applications in healthcare and make substantial steps towards the creation of foundation models for Medical AI. Our pretrained models and training code can be found at https://github.com/lab-smile/GatorBrain.

翻译：脑健康研究这一蓬勃发展的领域日益借助人工智能（AI）来解读和分析神经学数据。本研究通过整合一个源自41,400名参与者的大规模多模态磁共振成像（MRI）数据集，提出了一种创建医学基础模型的新方法。我们的方法采用一种新颖的两阶段预训练策略，使用视觉Transformer。第一阶段专注于编码一般健康大脑的解剖结构，识别不同脑区的形状和大小等关键特征。第二阶段侧重于空间信息，涵盖脑结构的位置和相对空间关系等方面。我们使用脑肿瘤分割挑战赛（BraTS）数据集和脑卒中后病灶解剖追踪v2.0（ATLAS v2.0）数据集对BrainFounder模型进行了严格评估。BrainFounder表现出显著的性能提升，超越了先前完全监督学习的获胜方案。我们的研究结果强调了同时扩展模型复杂性和源自一般健康大脑的无标注训练数据规模的重要性，这提升了模型在复杂MRI神经影像任务中的准确性和预测能力。这项研究的意义为医疗健康领域提供了变革性的见解和实际应用，并为创建医学AI基础模型迈出了重要步伐。我们的预训练模型和训练代码可在 https://github.com/lab-smile/GatorBrain 获取。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日