反正则化在参数模型中的收敛性与泛化性分析 (Convergence and Generalization of Anti-Regularization for Parametric Models)

Anti-regularization introduces a reward term with a reversed sign into the loss function, deliberately amplifying model expressivity in small-sample regimes while ensuring that the intervention gradually vanishes as the sample size grows through a power-law decay schedule. We formalize spectral safety conditions and trust-region constraints, and we design a lightweight safeguard that combines a projection operator with gradient clipping to guarantee stable intervention. Theoretical analysis extends to linear smoothers and the Neural Tangent Kernel regime, providing practical guidance on the choice of decay exponents through the balance between empirical risk and variance. Empirical results show that Anti-regularization mitigates underfitting in both regression and classification while preserving generalization and improving calibration. Ablation studies confirm that the decay schedule and safeguards are essential to avoiding overfitting and instability. As an alternative, we also propose a degrees-of-freedom targeting schedule that maintains constant per-sample complexity. Anti-regularization constitutes a simple and reproducible procedure that integrates seamlessly into standard empirical risk minimization pipelines, enabling robust learning under limited data and resource constraints by intervening only when necessary and vanishing otherwise.

翻译：反正则化通过在损失函数中引入符号相反的奖励项，在小样本场景下刻意增强模型表达能力，同时通过幂律衰减机制确保干预效果随样本量增加而逐渐消失。我们形式化地建立了谱安全条件与信赖域约束，并设计了一种结合投影算子与梯度裁剪的轻量级保护机制以保证干预稳定性。理论分析拓展至线性平滑器与神经正切核体系，通过经验风险与方差的权衡为衰减指数的选择提供实践指导。实验结果表明，反正则化在回归与分类任务中均能缓解欠拟合现象，同时保持泛化能力并改善校准效果。消融研究证实衰减机制与保护措施对避免过拟合和不稳定性具有关键作用。作为替代方案，我们同时提出一种保持恒定样本复杂度的自由度目标调度机制。反正则化构成了一种简单且可复现的流程，能够无缝集成至标准经验风险最小化框架中，通过仅在必要时实施干预并在其他情况下自然消退的方式，实现在有限数据与资源约束下的稳健学习。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

31+阅读 · 2021年9月29日

【亚马逊-WWW2020】不解析,生成!用于面向任务的语义分析的序列到序列体系结构，Don't Parse, Generate! A Sequence to Sequence Architecture for Task-Oriented Semantic Parsing

专知会员服务

15+阅读 · 2020年2月1日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日