GCondNet: A Novel Method for Improving Neural Networks on Small High-Dimensional Tabular Data

Neural network models often struggle with high-dimensional but small sample-size tabular datasets. One reason is that current weight initialisation methods assume independence between weights, which can be problematic when there are insufficient samples to estimate the model's parameters accurately. In such small data scenarios, leveraging additional structures can improve the model's performance and training stability. To address this, we propose GCondNet, a general approach to enhance neural networks by leveraging implicit structures present in tabular data. We create a graph between samples for each data dimension, and utilise Graph Neural Networks (GNNs) for extracting this implicit structure, and for conditioning the parameters of the first layer of an underlying predictor network. By creating many small graphs, GCondNet exploits the data's high-dimensionality, and thus improves the performance of an underlying predictor network. We demonstrate the effectiveness of our method on 9 real-world datasets, where GCondNet outperforms 15 standard and state-of-the-art methods. The results show that GCondNet is a versatile framework for injecting graph-regularisation into various types of neural networks, including MLPs and tabular Transformers.

翻译：神经网络模型在处理高维但小样本的表格数据集时常常面临挑战。其中一个原因是当前的权重初始化方法假设权重之间相互独立，这在样本量不足以准确估计模型参数时会引发问题。在这种小样本场景中，利用额外结构可以提升模型性能与训练稳定性。为此，我们提出GCondNet——一种通过利用表格数据中隐含结构来增强神经网络的通用方法。我们针对每个数据维度构建样本间的图，并利用图神经网络（GNNs）提取这种隐含结构，进而对底层预测网络首层参数进行条件化处理。通过构建大量小型图，GCondNet充分利用了数据的高维特性，从而提升了底层预测网络的性能。我们在9个真实数据集上验证了该方法的有效性，结果显示GCondNet优于15种标准方法与最新技术。结果表明，GCondNet是一种可将图正则化注入各类神经网络（包括MLPs与表格Transformer）的通用框架。

相关内容

Networking

关注 23

Networking：IFIP International Conferences on Networking。 Explanation：国际网络会议。 Publisher：IFIP。 SIT： http://dblp.uni-trier.de/db/conf/networking/index.html

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

【亚马逊-WWW2020】不解析,生成!用于面向任务的语义分析的序列到序列体系结构，Don't Parse, Generate! A Sequence to Sequence Architecture for Task-Oriented Semantic Parsing

专知会员服务

15+阅读 · 2020年2月1日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日