BrainNPT: Pre-training of Transformer networks for brain network classification

Deep learning methods have advanced quickly in brain imaging analysis over the past few years, but they are usually restricted by the limited labeled data. Pre-trained model on unlabeled data has presented promising improvement in feature learning in many domains, including natural language processing and computer vision. However, this technique is under-explored in brain network analysis. In this paper, we focused on pre-training methods with Transformer networks to leverage existing unlabeled data for brain functional network classification. First, we proposed a Transformer-based neural network, named as BrainNPT, for brain functional network classification. The proposed method leveraged <cls> token as a classification embedding vector for the Transformer model to effectively capture the representation of brain network. Second, we proposed a pre-training framework for BrainNPT model to leverage unlabeled brain network data to learn the structure information of brain networks. The results of classification experiments demonstrated the BrainNPT model without pre-training achieved the best performance with the state-of-the-art models, and the BrainNPT model with pre-training strongly outperformed the state-of-the-art models. The pre-training BrainNPT model improved 8.75% of accuracy compared with the model without pre-training. We further compared the pre-training strategies, analyzed the influence of the parameters of the model, and interpreted the trained model.

翻译：近年来，深度学习方法在脑影像分析领域取得了快速进展，但通常受限于有限的标注数据。在自然语言处理和计算机视觉等多个领域，基于无标注数据的预训练模型已在特征学习方面展现出显著提升。然而，该技术在脑网络分析中尚未得到充分探索。本文聚焦于利用Transformer网络的预训练方法，以充分利用现有无标注数据进行脑功能网络分类。首先，我们提出了一种名为BrainNPT的Transformer神经网络，用于脑功能网络分类。该方法利用<cls>标记作为Transformer模型的分类嵌入向量，有效捕获脑网络的表征。其次，我们为BrainNPT模型设计了预训练框架，利用无标注脑网络数据学习其结构信息。分类实验结果表明：未预训练的BrainNPT模型达到了与现有最优模型相当的性能，而经过预训练的BrainNPT模型则显著超越了现有最优模型。与未预训练模型相比，预训练BrainNPT模型的准确率提升了8.75%。此外，我们还进一步比较了不同预训练策略，分析了模型参数的影响，并对训练后的模型进行了解释。

相关内容

Networking

关注 23

Networking：IFIP International Conferences on Networking。 Explanation：国际网络会议。 Publisher：IFIP。 SIT： http://dblp.uni-trier.de/db/conf/networking/index.html

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

语言视觉预训练语言模型揭密，Behind the Scene: Revealing the Secrets of Pre-trained Vision-and-Language Models

专知会员服务

36+阅读 · 2020年5月20日

【AI应用】Facebook-利用神经网络求解高等数学方程, Using neural networks to solve advanced mathematics equations

专知会员服务

34+阅读 · 2020年1月15日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

35+阅读 · 2019年10月18日