Scalable Deep Basis Kernel Gaussian Processes

Learning expressive kernels while retaining tractable inference remains a central challenge in scaling Gaussian processes (GPs) to large and complex datasets. We propose a scalable GP regressor based on deep basis kernels (DBKs). Our DBK is constructed from a small set of neural-network-parameterized basis functions with an explicit low-rank structure. This formulation immediately enables linear-complexity inference with respect to the number of samples, possibly without inducing points. DBKs provide a unifying perspective that recovers sparse deep kernel learning and Gaussian Bayesian last-layer methods as special cases. We further identify that naively maximizing the marginal likelihood can lead to oversimplified uncertainty and rank-deficient solutions. To address this, we introduce a mini-batch stochastic objective that directly targets the predictive distribution with decoupled regularization. Empirically, DBKs show advantages in predictive accuracy, uncertainty quantification, and computational efficiency across a range of large-scale regression benchmarks.

翻译：在将高斯过程（GPs）扩展到大规模复杂数据集时，学习表达能力强的核函数同时保持推断的可处理性，仍然是一个核心挑战。我们提出了一种基于深度基核（DBKs）的可扩展高斯过程回归器。我们的深度基核由一小组具有显式低秩结构的神经网络参数化基函数构建而成。这种构建方式直接实现了相对于样本数量的线性复杂度推断，且可能无需使用诱导点。深度基核提供了一个统一的视角，将稀疏深度核学习与高斯贝叶斯最后一层方法作为其特例进行恢复。我们进一步发现，简单地最大化边缘似然可能导致过度简化的不确定性以及秩亏缺的解。为解决此问题，我们引入了一种小批量随机优化目标，该目标直接针对预测分布，并采用解耦的正则化方法。实验表明，深度基核在一系列大规模回归基准测试中，在预测准确性、不确定性量化和计算效率方面均展现出优势。

相关内容

高斯过程

关注 6

高斯过程（Gaussian Process, GP）是概率论和数理统计中随机过程（stochastic process）的一种，是一系列服从正态分布的随机变量（random variable）在一指数集（index set）内的组合。高斯过程中任意随机变量的线性组合都服从正态分布，每个有限维分布都是联合正态分布，且其本身在连续指数集上的概率密度函数即是所有随机变量的高斯测度，因此被视为联合正态分布的无限维广义延伸。高斯过程由其数学期望和协方差函数完全决定，并继承了正态分布的诸多性质

【剑桥博士论文】可扩展高斯过程：迭代方法与路径条件的进展

专知会员服务

16+阅读 · 2025年7月10日

水下通信《通信感知、可扩展高斯过程在分布式探索中的应用》186页

专知会员服务

20+阅读 · 2025年4月30日

【剑桥大学博士论文】在深度学习时代的可扩展贝叶斯推断：从高斯过程到深度神经网络

专知会员服务

56+阅读 · 2024年5月2日

【KAUST博士论文】朝向可扩展的深度3D感知与生成，109页pdf

专知会员服务

24+阅读 · 2023年10月19日