We investigate random feature models in which neural networks sampled from a prescribed initialization ensemble are frozen and used as random features, with only the readout weights optimized. Adopting a statistical-physics viewpoint, we study the training error, test error, and generalization gap beyond the mean kernel approximation. Since the predictor is a nonlinear functional of the induced random kernel, the ensemble-averaged errors depend not only on the mean kernel but also on higher-order fluctuation statistics. Within an effective field-theoretic framework, these finite-width contributions naturally appear as loop corrections. We derive loop corrections to the training error, test error, and generalization gap, obtain their scaling laws, and support the theory with experimental verification.
翻译:我们研究随机特征模型,其中从预设初始化系综中采样的神经网络被冻结并用作随机特征,仅优化读出权重。采用统计物理视角,我们研究平均核近似之外的训练误差、测试误差和泛化差距。由于预测器是诱导随机核的非线性函数,系综平均误差不仅取决于平均核,还取决于高阶涨落统计量。在有效场论框架内,这些有限宽度贡献自然表现为循环修正。我们推导了训练误差、测试误差和泛化差距的循环修正,获得了其标度律,并通过实验验证支持了该理论。