This study investigates the robustness of graph embedding methods for community detection in the face of network perturbations, specifically edge deletions. Graph embedding techniques, which represent nodes as low-dimensional vectors, are widely used for various graph machine learning tasks due to their ability to capture structural properties of networks effectively. However, the impact of perturbations on the performance of these methods remains relatively understudied. The research considers state-of-the-art graph embedding methods from two families: matrix factorization (e.g., LE, LLE, HOPE, M-NMF) and random walk-based (e.g., DeepWalk, LINE, node2vec). Through experiments conducted on both synthetic and real-world networks, the study reveals varying degrees of robustness within each family of graph embedding methods. The robustness is found to be influenced by factors such as network size, initial community partition strength, and the type of perturbation. Notably, node2vec and LLE consistently demonstrate higher robustness for community detection across different scenarios, including networks with degree and community size heterogeneity. These findings highlight the importance of selecting an appropriate graph embedding method based on the specific characteristics of the network and the task at hand, particularly in scenarios where robustness to perturbations is crucial.
翻译:本研究探讨了图嵌入方法在面对网络扰动(特别是边删除)时用于社区检测的鲁棒性。图嵌入技术将节点表示为低维向量,因其能有效捕捉网络结构特性而被广泛用于各种图机器学习任务。然而,扰动对这些方法性能的影响仍相对缺乏深入研究。该研究考虑了两类前沿图嵌入方法:矩阵分解法(如LE、LLE、HOPE、M-NMF)和基于随机游走的方法(如DeepWalk、LINE、node2vec)。通过在合成网络和真实网络上的实验,研究发现每类图嵌入方法均表现出不同程度的鲁棒性。鲁棒性受网络规模、初始社区划分强度及扰动类型等因素影响。值得注意的是,node2vec和LLE在不同场景下(包括具有度异质性和社区规模异质性的网络)均表现出更高的社区检测鲁棒性。这些发现凸显了根据网络具体特征和任务需求选择合适图嵌入方法的重要性,特别是在对扰动鲁棒性要求较高的场景中。