持续学习中的高斯过程模型规模调整：多大才足够？ (Adjusting Model Size in Continual Gaussian Processes: How Big is Big Enough?)

Many machine learning models require setting a parameter that controls their size before training, e.g. number of neurons in DNNs, or inducing points in GPs. Increasing capacity typically improves performance until all the information from the dataset is captured. After this point, computational cost keeps increasing, without improved performance. This leads to the question "How big is big enough?" We investigate this problem for Gaussian processes (single-layer neural networks) in continual learning. Here, data becomes available incrementally, and the final dataset size will therefore not be known before training, preventing the use of heuristics for setting a fixed model size. We develop a method to automatically adjust model size while maintaining near-optimal performance. Our experimental procedure follows the constraint that any hyperparameters must be set without seeing dataset properties, and we show that our method performs well across diverse datasets without the need to adjust its hyperparameter, showing it requires less tuning than others.

翻译：许多机器学习模型在训练前需要设置控制其规模的参数，例如深度神经网络中的神经元数量，或高斯过程中的诱导点数量。增加模型容量通常会提升性能，直至捕获数据集中的所有信息。超过此临界点后，计算成本持续增加，性能却不再改善。这引出了“多大才足够？”的核心问题。本研究针对持续学习场景下的高斯过程（单层神经网络）探讨该问题。在持续学习中，数据以增量方式出现，最终数据集规模在训练前无法预知，这使得基于启发式方法设定固定模型规模的策略失效。我们提出一种能自动调整模型规模并保持近似最优性能的方法。实验设计遵循“超参数设置不得依赖数据集特性”的约束条件，结果表明：我们的方法在多样化数据集上均表现良好，且无需调整其超参数，证明其调优需求低于其他方法。

相关内容

高斯过程

关注 6

高斯过程（Gaussian Process, GP）是概率论和数理统计中随机过程（stochastic process）的一种，是一系列服从正态分布的随机变量（random variable）在一指数集（index set）内的组合。高斯过程中任意随机变量的线性组合都服从正态分布，每个有限维分布都是联合正态分布，且其本身在连续指数集上的概率密度函数即是所有随机变量的高斯测度，因此被视为联合正态分布的无限维广义延伸。高斯过程由其数学期望和协方差函数完全决定，并继承了正态分布的诸多性质

【ICML2023】MetaModulation: 用更少任务进行小样本学习的变分特征层次结构学习

专知会员服务

35+阅读 · 2023年5月22日

【AAAI2023】基于Dirichlet元模型的事后不确定性学习

专知会员服务

16+阅读 · 2022年12月16日

【超越消息传递:图神经网络的物理启发范式】Beyond Message Passing: a Physics-Inspired Paradigm for Graph Neural Networks

专知会员服务

17+阅读 · 2022年5月10日

【ICLR2021】基于动态正则化的联邦学习

专知会员服务

42+阅读 · 2021年1月18日