Graph neural networks (GNNs) have been widely used under semi-supervised settings. Prior studies have mainly focused on finding appropriate graph filters (e.g., aggregation schemes) to generalize well for both homophilic and heterophilic graphs. Even though these approaches are essential and effective, they still suffer from the sparsity in initial node features inherent in the bag-of-words representation. Common in semi-supervised learning where the training samples often fail to cover the entire dimensions of graph filters (hyperplanes), this can precipitate over-fitting of specific dimensions in the first projection matrix. To deal with this problem, we suggest a simple and novel strategy; create additional space by flipping the initial features and hyperplane simultaneously. Training in both the original and in the flip space can provide precise updates of learnable parameters. To the best of our knowledge, this is the first attempt that effectively moderates the overfitting problem in GNN. Extensive experiments on real-world datasets demonstrate that the proposed technique improves the node classification accuracy up to 40.2 %
翻译:图神经网络(GNN)在半监督设置下已被广泛应用。先前的研究主要集中在寻找合适的图滤波器(例如聚合方案)以在同质图和异质图上实现良好的泛化。尽管这些方法至关重要且有效,但它们仍然受到基于词袋表示的初始节点特征稀疏性的影响。在半监督学习中,训练样本通常无法覆盖图滤波器(超平面)的所有维度,这可能导致第一投影矩阵中特定维度的过拟合。为解决这一问题,我们提出一种简单而新颖的策略:通过同时翻转初始特征和超平面来创建额外空间。在原始空间和翻转空间中训练可以精确更新可学习参数。据我们所知,这是首次有效缓解GNN中过拟合问题的尝试。在真实世界数据集上的大量实验表明,所提出技术将节点分类准确率最高提升了40.2%。