A unified cross-attention model for predicting antigen binding specificity to both HLA and TCR molecules

The immune checkpoint inhibitors have demonstrated promising clinical efficacy across various tumor types, yet the percentage of patients who benefit from them remains low. The bindings between tumor antigens and HLA-I/TCR molecules determine the antigen presentation and T-cell activation, thereby playing an important role in the immunotherapy response. In this paper, we propose UnifyImmun, a unified cross-attention transformer model designed to simultaneously predict the bindings of peptides to both receptors, providing more comprehensive evaluation of antigen immunogenicity. We devise a two-phase strategy using virtual adversarial training that enables these two tasks to reinforce each other mutually, by compelling the encoders to extract more expressive features. Our method demonstrates superior performance in predicting both pHLA and pTCR binding on multiple independent and external test sets. Notably, on a large-scale COVID-19 pTCR binding test set without any seen peptide in training set, our method outperforms the current state-of-the-art methods by more than 10\%. The predicted binding scores significantly correlate with the immunotherapy response and clinical outcomes on two clinical cohorts. Furthermore, the cross-attention scores and integrated gradients reveal the amino-acid sites critical for peptide binding to receptors. In essence, our approach marks a significant step toward comprehensive evaluation of antigen immunogenicity.

翻译：免疫检查点抑制剂已在多种肿瘤类型中展现出良好的临床疗效，但能从中获益的患者比例仍然较低。肿瘤抗原与HLA-I/TCR分子之间的结合决定了抗原呈递和T细胞活化，从而在免疫治疗应答中发挥重要作用。本文提出UnifyImmun，一种统一的交叉注意力Transformer模型，旨在同时预测肽段与这两种受体的结合，从而提供更全面的抗原免疫原性评估。我们设计了一种采用虚拟对抗训练的两阶段策略，通过迫使编码器提取更具表达力的特征，使这两个任务能够相互促进。我们的方法在多个独立外部测试集上预测pHLA和pTCR结合均表现出优越性能。值得注意的是，在一个训练集中未出现任何已知肽段的大规模COVID-19 pTCR结合测试集上，我们的方法比当前最先进方法的性能高出10%以上。预测的结合分数与两个临床队列的免疫治疗应答及临床结局显著相关。此外，交叉注意力分数和积分梯度揭示了肽段与受体结合的关键氨基酸位点。本质上，我们的方法标志着向全面评估抗原免疫原性迈出了重要一步。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

《生成式模型: 变分自编码器与扩散模型》，75页ppt，Google DeepMind科学家Ruiqi Gao

专知会员服务

66+阅读 · 2023年6月10日

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

生成性对抗网络:理论模型、评估指标和最近发展的概述，Generative Adversarial Networks (GANs): An Overview of Theoretical Model, Evaluation Metrics, and Recent Developments

专知会员服务

42+阅读 · 2020年5月30日

【AI应用】Facebook-利用神经网络求解高等数学方程, Using neural networks to solve advanced mathematics equations

专知会员服务

34+阅读 · 2020年1月15日