Graph-based fraud detection (GFD) can be regarded as a challenging semi-supervised node binary classification task. In recent years, Graph Neural Networks (GNN) have been widely applied to GFD, characterizing the anomalous possibility of a node by aggregating neighbor information. However, fraud graphs are inherently heterophilic, thus most of GNNs perform poorly due to their assumption of homophily. In addition, due to the existence of heterophily and class imbalance problem, the existing models do not fully utilize the precious node label information. To address the above issues, this paper proposes a semi-supervised GNN-based fraud detector SEC-GFD. This detector includes a hybrid filtering module and a local environmental constraint module, the two modules are utilized to solve heterophily and label utilization problem respectively. The first module starts from the perspective of the spectral domain, and solves the heterophily problem to a certain extent. Specifically, it divides the spectrum into various mixed-frequency bands based on the correlation between spectrum energy distribution and heterophily. Then in order to make full use of the node label information, a local environmental constraint module is adaptively designed. The comprehensive experimental results on four real-world fraud detection datasets denote that SEC-GFD outperforms other competitive graph-based fraud detectors. We release our code at https://github.com/Sunxkissed/SEC-GFD.
翻译:基于图的欺诈检测(GFD)可视为一项具有挑战性的半监督节点二分类任务。近年来,图神经网络(GNN)被广泛用于GFD,通过聚合邻居信息来表征节点的异常可能性。然而,欺诈图本质上具有异质性,因此大多数GNN因其同质性假设而表现不佳。此外,由于异质性和类别不平衡问题的存在,现有模型未能充分利用宝贵的节点标签信息。为解决上述问题,本文提出一种基于半监督GNN的欺诈检测器SEC-GFD。该检测器包含混合滤波模块和局部环境约束模块,分别用于解决异质性和标签利用问题。第一模块从频谱域角度出发,在一定程度上解决了异质性问题。具体而言,它根据频谱能量分布与异质性之间的相关性,将频谱划分为多种混合频带。随后,为充分利用节点标签信息,自适应设计了局部环境约束模块。在四个真实世界欺诈检测数据集上的综合实验结果表明,SEC-GFD优于其他具有竞争力的基于图的欺诈检测器。我们已在https://github.com/Sunxkissed/SEC-GFD发布代码。