Graph-based fraud detection (GFD) can be regarded as a challenging semi-supervised node binary classification task. In recent years, Graph Neural Networks(GNN) have been widely applied to GFD, characterizing the anomalous possibility of a node by aggregating neighbor information. However, fraud graphs are inherently heterophilic, thus most of GNNs perform poorly due to their assumption of homophily. In addition, due to the existence of heterophily and class imbalance problem, the existing models do not fully utilize the precious node label information. To address the above issues, this paper proposes a semi-supervised GNN-based fraud detector SEC-GFD. This detector includes a hybrid filtering module and a local environmental constraint module, the two modules are utilized to solve heterophily and label utilization problem respectively. The first module starts from the perspective of the spectral domain, and solves the heterophily problem to a certain extent. Specifically, it divides the spectrum into multiple mixed frequency bands according to the correlation between spectrum energy distribution and heterophily. Then in order to make full use of the node label information, a local environmental constraint module is adaptively designed. The comprehensive experimental results on four real-world fraud detection datasets show that SEC-GFD outperforms other competitive graph-based fraud detectors.
翻译:基于图的欺诈检测(GFD)可视为一项具有挑战性的半监督节点二分类任务。近年来,图神经网络(GNN)被广泛应用于GFD,通过聚合邻居信息刻画节点的异常可能性。然而,欺诈图天生具有异质性,因此大多数GNN因假设同质性而表现不佳。此外,由于异质性和类别不平衡问题的存在,现有模型未能充分利用宝贵的节点标签信息。为解决上述问题,本文提出一种半监督GNN欺诈检测器SEC-GFD。该检测器包含混合滤波模块和局部环境约束模块,分别用于解决异质性和标签利用问题。第一个模块从谱域角度出发,在一定程度上解决了异质性难题:具体而言,根据频谱能量分布与异质性的相关性,将频谱划分为多个混合频带。随后,为充分利用节点标签信息,自适应设计了局部环境约束模块。在四个真实欺诈检测数据集上的综合实验结果表明,SEC-GFD优于其他具有竞争力的图欺诈检测器。