A backdoor attack against link prediction tasks with graph neural networks

Graph Neural Networks (GNNs) are a class of deep learning models capable of processing graph-structured data, and they have demonstrated significant performance in a variety of real-world applications. Recent studies have found that GNN models are vulnerable to backdoor attacks. When specific patterns (called backdoor triggers, e.g., subgraphs, nodes, etc.) appear in the input data, the backdoor embedded in the GNN models is activated, which misclassifies the input data into the target class label specified by the attacker, whereas when there are no backdoor triggers in the input, the backdoor embedded in the GNN models is not activated, and the models work normally. Backdoor attacks are highly stealthy and expose GNN models to serious security risks. Currently, research on backdoor attacks against GNNs mainly focus on tasks such as graph classification and node classification, and backdoor attacks against link prediction tasks are rarely studied. In this paper, we propose a backdoor attack against the link prediction tasks based on GNNs and reveal the existence of such security vulnerability in GNN models, which make the backdoored GNN models to incorrectly predict unlinked two nodes as having a link relationship when a trigger appear. The method uses a single node as the trigger and poison selected node pairs in the training graph, and then the backdoor will be embedded in the GNN models through the training process. In the inference stage, the backdoor in the GNN models can be activated by simply linking the trigger node to the two end nodes of the unlinked node pairs in the input data, causing the GNN models to produce incorrect link prediction results for the target node pairs.

翻译：图神经网络（GNNs）是一类能够处理图结构数据的深度学习模型，并在多种实际应用中展现出显著性能。近期研究发现，GNN模型易受后门攻击。当输入数据中出现特定模式（称为后门触发器，如子图、节点等）时，嵌入GNN模型的后门被激活，导致输入数据被误分类为攻击者指定的目标类别标签；而当输入中不包含后门触发器时，嵌入GNN模型的后门不会被激活，模型正常工作。后门攻击具有高度隐蔽性，使GNN模型面临严重安全风险。目前，针对GNN的后门攻击研究主要集中在图分类和节点分类等任务上，而对链路预测任务的后门攻击研究较少。本文提出一种基于GNN的链路预测任务后门攻击方法，揭示了GNN模型中此类安全漏洞的存在——当触发器出现时，被植入后门的GNN模型会将原本无连接的两个节点误判为存在链接关系。该方法采用单个节点作为触发器，在训练图中毒化选定的节点对，通过训练过程将后门嵌入GNN模型。在推理阶段，只需在输入数据中将触发器节点与目标无连接节点对的两个端点相连，即可激活GNN模型中的后门，使模型对目标节点对产生错误的链路预测结果。

相关内容

链路预测

关注 14

网络中的链路预测(Link Prediction)是指如何通过已知的网络节点以及网络结构等信息预测网络中尚未产生连边的两个节点之间产生链接的可能性。这种预测既包含了对未知链接（exist yet unknown links）的预测也包含了对未来链接（future links）的预测。该问题的研究在理论和应用两个方面都具有重要的意义和价值。

《生成式模型: 变分自编码器与扩散模型》，75页ppt，Google DeepMind科学家Ruiqi Gao

专知会员服务

66+阅读 · 2023年6月10日

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

【ACL2020】多模态信息抽取，365页ppt

专知会员服务

151+阅读 · 2020年7月6日

【WSDM2020】超越统计关系：将知识关系整合到多标签音乐风格分类的风格关联中（附pdf）

专知会员服务

18+阅读 · 2019年11月23日