Linear-Core Surrogates: Smooth Loss Functions with Linear Rates for Classification and Structured Prediction - 专知论文

会员服务 ·

0

损失 · 结构 · 结构化 · 损失函数 · 一致 ·

Linear-Core Surrogates: Smooth Loss Functions with Linear Rates for Classification and Structured Prediction

翻译：线性核心代理：分类与结构化预测中具有线性速率的光滑损失函数

Mehryar Mohri,Yutao Zhong

The choice of loss function in classification involves a fundamental trade-off: smooth losses (like Cross-Entropy) enable fast optimization rates but yield slow square-root consistency bounds, while piecewise-linear losses (like Hinge) offer fast linear consistency rates but suffer from non-differentiability. We propose Linear-Core (LC) Surrogates, a new family of convex loss functions that resolve this tension by stitching a linear core to a smooth tail. We prove that these surrogates are differentiable everywhere while retaining strict linear $H$-consistency bounds, effectively combining the optimization benefits of smoothness with the statistical efficiency of margin-based losses. In the structured prediction setting, we show that this smoothness unlocks a massive computational and energy advantage: it allows for an unbiased stochastic gradient estimator that bypasses the quadratic complexity $O(|\mathscr{Y}|^2)$ of exact inference (e.g., Viterbi). Empirically, our method achieves a 23$\times$ speedup over Structured SVMs on large-vocabulary sequence tagging tasks and demonstrates superior robustness to instance-dependent label noise, outperforming Cross-Entropy by 2.6% on corrupted CIFAR-10.

翻译：损失函数的选择在分类中涉及一个基本权衡：光滑损失（如交叉熵）能够实现快速的优化速率，但产生缓慢的平方根一致性边界；而分段线性损失（如合页损失）提供快速的线性一致性速率，却面临不可微性问题。我们提出线性核心（Linear-Core, LC）代理函数，这是一类新的凸损失函数族，通过将线性核心与光滑尾部拼接来解决这一矛盾。我们证明这些代理函数在保持严格线性$H$-一致性边界的同时处处可微，有效结合了光滑性的优化优势与基于间隔损失的统计效率。在结构化预测场景中，我们展示了这种光滑性带来了巨大的计算和能量优势：它允许一种无偏随机梯度估计器，绕过了精确推理（如维特比算法）的二次复杂度$O(|\mathscr{Y}|^2)$。实验上，我们的方法在大词汇量序列标注任务上比结构化支持向量机实现了23倍的加速，并在对实例相关标签噪声表现出优越的鲁棒性，在受损CIFAR-10数据集上比交叉熵高出2.6%。

0

相关内容

【2023新书】光滑流形上的优化引论，368页pdf

【2023新书】光滑流形上的优化引论，368页pdf

专知会员服务

56+阅读 · 2023年8月7日

如何用机器学习损失函数？最新《机器学习损失函数》综述，详述其33个损失函数与分类法

如何用机器学习损失函数？最新《机器学习损失函数》综述，详述其33个损失函数与分类法

专知会员服务

70+阅读 · 2023年1月17日

机器学习损失函数概述，Loss Functions in Machine Learning

机器学习损失函数概述，Loss Functions in Machine Learning

专知会员服务

84+阅读 · 2022年3月19日

【EPFL-Nicolas Boumal新书】光滑流形优化导论，362页pdf，An introduction to optimization on smooth manifolds

【EPFL-Nicolas Boumal新书】光滑流形优化导论，362页pdf，An introduction to optimization on smooth manifolds

专知会员服务

34+阅读 · 2022年3月4日

【NeurIPS 2021 】为目标检测搜索参数化平均准确率损失函数

【NeurIPS 2021 】为目标检测搜索参数化平均准确率损失函数

专知会员服务

19+阅读 · 2021年12月12日

ICCV'21 Oral｜拒绝调参，显著提点！检测分割任务的新损失函数RS Loss开源

专知会员服务

16+阅读 · 2021年8月11日

【经典书】线性代数，436页pdf

专知会员服务

78+阅读 · 2021年3月16日

【经典书】线性代数，Linear Algebra，525页pdf

【经典书】线性代数，Linear Algebra，525页pdf

专知会员服务

79+阅读 · 2021年1月29日

【理解计算机视觉损失函数】《Understanding Loss Functions in Computer Vision!》by Sowmya Yellapragad

【理解计算机视觉损失函数】《Understanding Loss Functions in Computer Vision!》by Sowmya Yellapragad

专知会员服务

44+阅读 · 2020年3月4日

【电子书|交互式线性代数】《Interactive Linear Algebra》by Dan Margalit, Joseph Rabinoff（附455页pdf）

【电子书|交互式线性代数】《Interactive Linear Algebra》by Dan Margalit, Joseph Rabinoff（附455页pdf）

专知会员服务

69+阅读 · 2019年11月30日

【干货书】机器学习线性代数与优化，507页pdf

【干货书】机器学习线性代数与优化，507页pdf

专知

23+阅读 · 2022年7月28日

一文看尽15种语义分割损失函数（含代码解析）

一文看尽15种语义分割损失函数（含代码解析）

CVer

82+阅读 · 2020年7月2日

一文读懂线性回归、岭回归和Lasso回归

一文读懂线性回归、岭回归和Lasso回归

CSDN

34+阅读 · 2019年10月13日

【机器学习】一文读懂线性回归、岭回归和Lasso回归

【机器学习】一文读懂线性回归、岭回归和Lasso回归

AINLP

20+阅读 · 2019年10月12日

【学界】CVPR 2019 | 旷视研究院提出新型损失函数：改善边界框模糊问题

【学界】CVPR 2019 | 旷视研究院提出新型损失函数：改善边界框模糊问题

GAN生成式对抗网络

14+阅读 · 2019年5月20日

从信息论的角度来理解损失函数

从信息论的角度来理解损失函数

深度学习每日摘要

17+阅读 · 2019年4月7日

那些值得推荐和收藏的线性代数学习资源

那些值得推荐和收藏的线性代数学习资源

AINLP

25+阅读 · 2019年3月6日

换个角度看GAN：另一种损失函数

换个角度看GAN：另一种损失函数

机器之心

16+阅读 · 2019年1月1日

详解常见的损失函数

详解常见的损失函数

七月在线实验室

20+阅读 · 2018年7月12日

【干货】深度学习中的线性代数

【干货】深度学习中的线性代数

专知

21+阅读 · 2018年3月30日

删失数据超高维共线性模型的变量选择

国家自然科学基金

0+阅读 · 2017年12月31日

测量误差数据下部分线性模型有约束统计推断理论

国家自然科学基金

2+阅读 · 2015年12月31日

密码函数二阶非线性度快速算法及其紧下界研究

国家自然科学基金

0+阅读 · 2015年12月31日

非光滑非凸优化问题的交替线性化算法及其应用

国家自然科学基金

6+阅读 · 2015年12月31日

有限域上的代数曲线在纠错码构造中的几点应用

国家自然科学基金

0+阅读 · 2015年12月31日

光滑函数类的熵数估计

国家自然科学基金

0+阅读 · 2015年12月31日

生成函数运算下细分光滑性变化规律研究

国家自然科学基金

0+阅读 · 2015年12月31日

套代数框架下时变线性系统的同时稳定化

国家自然科学基金

0+阅读 · 2015年12月31日

测量误差数据下约束线性模型的有偏估计及变量选择研究

国家自然科学基金

0+阅读 · 2014年12月31日

求解非线性方程的加速迭代算法

国家自然科学基金

0+阅读 · 2014年12月31日

Black-box optimization of noisy functions with unknown smoothness

Arxiv

0+阅读 · 5月4日

Fast and Exact: Asymptotically Linear KL-Optimal Frequency Normalization

Arxiv

0+阅读 · 5月1日

A Kernel Score Perspective on Forecast Disagreement and the Linear Pool

Arxiv

0+阅读 · 4月29日

Plotkin-like Bound and Explicit Function-Correcting Code Constructions for Lee Metric Channels

Arxiv

0+阅读 · 4月28日

Distributional Robustness of Linear Contracts

Arxiv

0+阅读 · 4月27日

Isotonic Layer: A Unified Framework for Recommendation Calibration and Debiasing

Arxiv

0+阅读 · 4月27日

Universal, sample-optimal algorithms for recovery of anisotropic functions from i.i.d. samples

Arxiv

0+阅读 · 4月8日

On the Eigenvalue Decay Rates of a Class of Neural-Network Related Kernel Functions Defined on General Domains

Arxiv

0+阅读 · 4月7日

On Representability of Multiple-Valued Functions by Linear Lambda Terms Typed with Second-order Polymorphic Type System

Arxiv

0+阅读 · 3月27日

On Representability of Multiple-Valued Functions by Linear Lambda Terms Typed with Second-order Polymorphic Type System

Arxiv

0+阅读 · 3月26日

VIP会员

文章信息

相关主题

最新内容

ICML 2026 | CFPO：用反事实策略优化提升多模态推理

ICML 2026 | CFPO：用反事实策略优化提升多模态推理

专知会员服务

1+阅读 · 今天14:45

综述 | 世界动作模型：少做梦，多行动

综述 | 世界动作模型：少做梦，多行动

专知会员服务

2+阅读 · 今天14:43

美以伊冲突：无人机与人工智能的运用

美以伊冲突：无人机与人工智能的运用

专知会员服务

4+阅读 · 今天14:31

《战时图神经网络：整合以色列-伊朗冲突中的网络安全与无人机智能》最新50页文献

《战时图神经网络：整合以色列-伊朗冲突中的网络安全与无人机智能》最新50页文献

专知会员服务

3+阅读 · 今天14:20

《特种部队在透明战场中的生存力》最新报告

《特种部队在透明战场中的生存力》最新报告

专知会员服务

3+阅读 · 今天14:11

《自主无人机蜂群协同与控制系统：人工智能赋能的战场协同与自主任务编排平台》

《自主无人机蜂群协同与控制系统：人工智能赋能的战场协同与自主任务编排平台》

专知会员服务

3+阅读 · 今天14:07

《人工智能生成的零日漏洞：对未来作战的影响》

《人工智能生成的零日漏洞：对未来作战的影响》

专知会员服务

3+阅读 · 今天14:03

《理解伙伴国在防务能力选择中的偏好：探索美国解决方案的替代选择》美智库200页报告

《理解伙伴国在防务能力选择中的偏好：探索美国解决方案的替代选择》美智库200页报告

专知会员服务

2+阅读 · 今天13:59

ICML 2026 | 边界嵌入塑形：用自适应对比学习破解图结构纠缠

ICML 2026 | 边界嵌入塑形：用自适应对比学习破解图结构纠缠

专知会员服务

5+阅读 · 6月22日

综述 | 3D场景图：开放挑战与未来方向

综述 | 3D场景图：开放挑战与未来方向

专知会员服务

8+阅读 · 6月22日

《国防工业6.0：全自主作战系统、量子-人工智能融合与新一代战略威慑》

《国防工业6.0：全自主作战系统、量子-人工智能融合与新一代战略威慑》

专知会员服务

7+阅读 · 6月22日

21世纪的无人机战争

21世纪的无人机战争

专知会员服务

4+阅读 · 6月22日

《伊朗与以色列-美国热战及其对数字技术的影响》

《伊朗与以色列-美国热战及其对数字技术的影响》

专知会员服务

5+阅读 · 6月22日

《量子技术的军事任务技术适配与利用》

《量子技术的军事任务技术适配与利用》

专知会员服务

5+阅读 · 6月22日

《美国陆军军官学校（西点军校）本科生科研中生成式人工智能的使用》

《美国陆军军官学校（西点军校）本科生科研中生成式人工智能的使用》

专知会员服务

8+阅读 · 6月22日

相关VIP内容

【2023新书】光滑流形上的优化引论，368页pdf

【2023新书】光滑流形上的优化引论，368页pdf

专知会员服务

56+阅读 · 2023年8月7日

如何用机器学习损失函数？最新《机器学习损失函数》综述，详述其33个损失函数与分类法

如何用机器学习损失函数？最新《机器学习损失函数》综述，详述其33个损失函数与分类法

专知会员服务

70+阅读 · 2023年1月17日

机器学习损失函数概述，Loss Functions in Machine Learning

机器学习损失函数概述，Loss Functions in Machine Learning

专知会员服务

84+阅读 · 2022年3月19日

【EPFL-Nicolas Boumal新书】光滑流形优化导论，362页pdf，An introduction to optimization on smooth manifolds

【EPFL-Nicolas Boumal新书】光滑流形优化导论，362页pdf，An introduction to optimization on smooth manifolds

专知会员服务

34+阅读 · 2022年3月4日

【NeurIPS 2021 】为目标检测搜索参数化平均准确率损失函数

【NeurIPS 2021 】为目标检测搜索参数化平均准确率损失函数

专知会员服务

19+阅读 · 2021年12月12日

ICCV'21 Oral｜拒绝调参，显著提点！检测分割任务的新损失函数RS Loss开源

专知会员服务

16+阅读 · 2021年8月11日

【经典书】线性代数，436页pdf

专知会员服务

78+阅读 · 2021年3月16日

【经典书】线性代数，Linear Algebra，525页pdf

【经典书】线性代数，Linear Algebra，525页pdf

专知会员服务

79+阅读 · 2021年1月29日

【理解计算机视觉损失函数】《Understanding Loss Functions in Computer Vision!》by Sowmya Yellapragad

【理解计算机视觉损失函数】《Understanding Loss Functions in Computer Vision!》by Sowmya Yellapragad

专知会员服务

44+阅读 · 2020年3月4日

【电子书|交互式线性代数】《Interactive Linear Algebra》by Dan Margalit, Joseph Rabinoff（附455页pdf）

【电子书|交互式线性代数】《Interactive Linear Algebra》by Dan Margalit, Joseph Rabinoff（附455页pdf）

专知会员服务

69+阅读 · 2019年11月30日

热门VIP内容

开通专知VIP会员享更多权益服务

综述 | 世界动作模型：少做梦，多行动

《战时图神经网络：整合以色列-伊朗冲突中的网络安全与无人机智能》最新50页文献

ICML 2026 | CFPO：用反事实策略优化提升多模态推理

美以伊冲突：无人机与人工智能的运用

相关资讯

【干货书】机器学习线性代数与优化，507页pdf

【干货书】机器学习线性代数与优化，507页pdf

专知

23+阅读 · 2022年7月28日

一文看尽15种语义分割损失函数（含代码解析）

一文看尽15种语义分割损失函数（含代码解析）

CVer

82+阅读 · 2020年7月2日

一文读懂线性回归、岭回归和Lasso回归

一文读懂线性回归、岭回归和Lasso回归

CSDN

34+阅读 · 2019年10月13日

【机器学习】一文读懂线性回归、岭回归和Lasso回归

【机器学习】一文读懂线性回归、岭回归和Lasso回归

AINLP

20+阅读 · 2019年10月12日

【学界】CVPR 2019 | 旷视研究院提出新型损失函数：改善边界框模糊问题

【学界】CVPR 2019 | 旷视研究院提出新型损失函数：改善边界框模糊问题

GAN生成式对抗网络

14+阅读 · 2019年5月20日

从信息论的角度来理解损失函数

从信息论的角度来理解损失函数

深度学习每日摘要

17+阅读 · 2019年4月7日

那些值得推荐和收藏的线性代数学习资源

那些值得推荐和收藏的线性代数学习资源

AINLP

25+阅读 · 2019年3月6日

换个角度看GAN：另一种损失函数

换个角度看GAN：另一种损失函数

机器之心

16+阅读 · 2019年1月1日

详解常见的损失函数

详解常见的损失函数

七月在线实验室

20+阅读 · 2018年7月12日

【干货】深度学习中的线性代数

【干货】深度学习中的线性代数

专知

21+阅读 · 2018年3月30日

相关论文

Black-box optimization of noisy functions with unknown smoothness

Arxiv

0+阅读 · 5月4日

Fast and Exact: Asymptotically Linear KL-Optimal Frequency Normalization

Arxiv

0+阅读 · 5月1日

A Kernel Score Perspective on Forecast Disagreement and the Linear Pool

Arxiv

0+阅读 · 4月29日

Plotkin-like Bound and Explicit Function-Correcting Code Constructions for Lee Metric Channels

Arxiv

0+阅读 · 4月28日

Distributional Robustness of Linear Contracts

Arxiv

0+阅读 · 4月27日

Isotonic Layer: A Unified Framework for Recommendation Calibration and Debiasing

Arxiv

0+阅读 · 4月27日

Universal, sample-optimal algorithms for recovery of anisotropic functions from i.i.d. samples

Arxiv

0+阅读 · 4月8日

On the Eigenvalue Decay Rates of a Class of Neural-Network Related Kernel Functions Defined on General Domains

Arxiv

0+阅读 · 4月7日

On Representability of Multiple-Valued Functions by Linear Lambda Terms Typed with Second-order Polymorphic Type System

Arxiv

0+阅读 · 3月27日

On Representability of Multiple-Valued Functions by Linear Lambda Terms Typed with Second-order Polymorphic Type System

Arxiv

0+阅读 · 3月26日

相关基金

删失数据超高维共线性模型的变量选择

国家自然科学基金

0+阅读 · 2017年12月31日

测量误差数据下部分线性模型有约束统计推断理论

国家自然科学基金

2+阅读 · 2015年12月31日

密码函数二阶非线性度快速算法及其紧下界研究

国家自然科学基金

0+阅读 · 2015年12月31日

非光滑非凸优化问题的交替线性化算法及其应用

国家自然科学基金

6+阅读 · 2015年12月31日

有限域上的代数曲线在纠错码构造中的几点应用

国家自然科学基金

0+阅读 · 2015年12月31日

光滑函数类的熵数估计

国家自然科学基金

0+阅读 · 2015年12月31日

生成函数运算下细分光滑性变化规律研究

国家自然科学基金

0+阅读 · 2015年12月31日

套代数框架下时变线性系统的同时稳定化

国家自然科学基金

0+阅读 · 2015年12月31日

测量误差数据下约束线性模型的有偏估计及变量选择研究

国家自然科学基金

0+阅读 · 2014年12月31日

求解非线性方程的加速迭代算法

国家自然科学基金

0+阅读 · 2014年12月31日

微信扫码咨询专知VIP会员