Provable Benefit of Mixup for Finding Optimal Decision Boundaries - 专知论文

会员服务 ·

0

优化器 · Mixup · 样本复杂度 · 分离的 · 模型评估 ·

2023 年 6 月 6 日

Provable Benefit of Mixup for Finding Optimal Decision Boundaries

翻译：混合数据增强方法在寻找最优决策边界中的可证优势

Junsoo Oh,Chulhee Yun

from arxiv, ICML 2023 camera-ready version; 48 pages

We investigate how pair-wise data augmentation techniques like Mixup affect the sample complexity of finding optimal decision boundaries in a binary linear classification problem. For a family of data distributions with a separability constant $\kappa$, we analyze how well the optimal classifier in terms of training loss aligns with the optimal one in test accuracy (i.e., Bayes optimal classifier). For vanilla training without augmentation, we uncover an interesting phenomenon named the curse of separability. As we increase $\kappa$ to make the data distribution more separable, the sample complexity of vanilla training increases exponentially in $\kappa$; perhaps surprisingly, the task of finding optimal decision boundaries becomes harder for more separable distributions. For Mixup training, we show that Mixup mitigates this problem by significantly reducing the sample complexity. To this end, we develop new concentration results applicable to $n^2$ pair-wise augmented data points constructed from $n$ independent data, by carefully dealing with dependencies between overlapping pairs. Lastly, we study other masking-based Mixup-style techniques and show that they can distort the training loss and make its minimizer converge to a suboptimal classifier in terms of test accuracy.

翻译：我们研究成对数据增强技术（如Mixup）如何影响二元线性分类问题中寻找最优决策边界的样本复杂度。针对具有可分性常数$\kappa$的一类数据分布，我们分析了训练损失最优分类器与测试准确率最优分类器（即贝叶斯最优分类器）的对齐程度。在未使用数据增强的标准训练中，我们揭示了一个名为"可分性诅咒"的有趣现象：随着$\kappa$增大使数据分布更具可分性，标准训练的样本复杂度随$\kappa$呈指数级增长；令人惊讶的是，对于更具可分性的分布，找到最优决策边界的任务反而变得更加困难。对于Mixup训练，我们证明Mixup通过显著降低样本复杂度缓解了这一问题。为此，我们开发了适用于由$n$个独立数据构造的$n^2$个成对增强数据点的新浓度结果，并谨慎处理了重叠对之间的依赖关系。最后，我们研究了其他基于掩码的Mixup风格技术，发现它们可能扭曲训练损失，使其最小化器在测试准确率上收敛至次优分类器。

0

相关内容

优化器

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【ETH】最新《几何数据分析》2020课程，附PPT下载

专知会员服务

45+阅读 · 2020年12月18日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

52+阅读 · 2020年12月14日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

专知

13+阅读 · 2018年6月24日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

【推荐】GAN架构入门综述(资源汇总)

【推荐】GAN架构入门综述(资源汇总)

机器学习研究会

10+阅读 · 2017年9月3日

空间分数阶质量守恒型Allen-Cahn方程的高效数值算法研究

国家自然科学基金

0+阅读 · 2015年12月31日

Schr？dinger-Poisson方程守恒DDG方法研究

国家自然科学基金

2+阅读 · 2015年12月31日

随机辛算法和多辛算法

国家自然科学基金

2+阅读 · 2014年12月31日

长链非编码RNA CAR intergenic 10在细胞衰老中的作用和机制

国家自然科学基金

1+阅读 · 2013年12月31日

负载型过渡金属碳化物催化剂的合成、表征及在染料敏化太阳能电池中的应用

国家自然科学基金

0+阅读 · 2013年12月31日

随机泛函微分方程的适定性与渐近性分析

国家自然科学基金

0+阅读 · 2012年12月31日

不可靠通信环境下复杂动态网络状态估计与故障诊断

国家自然科学基金

0+阅读 · 2012年12月31日

WRKY类转录因子在托品烷类生物碱生物合成中的调控作用研究

国家自然科学基金

0+阅读 · 2012年12月31日

多智能体不确定性系统的自适应一致性问题研究

国家自然科学基金

6+阅读 · 2012年12月31日

以带隙可调的Zn(O,S)梯度薄膜为缓层的CuInS2薄膜太阳能电池研究

国家自然科学基金

1+阅读 · 2009年12月31日

ODTlearn: A Package for Learning Optimal Decision Trees for Prediction and Prescription

ODTlearn: A Package for Learning Optimal Decision Trees for Prediction and Prescription

Arxiv

0+阅读 · 2023年7月28日

f-Divergence Minimization for Sequence-Level Knowledge Distillation

Arxiv

0+阅读 · 2023年7月27日

From Contextual Data to Newsvendor Decisions: On the Actual Performance of Data-Driven Algorithms

Arxiv

1+阅读 · 2023年7月27日

Simplified Concrete Dropout -- Improving the Generation of Attribution Masks for Fine-grained Classification

Arxiv

0+阅读 · 2023年7月27日

A Verified Efficient Implementation of the Weighted Path Order

Arxiv

0+阅读 · 2023年7月27日

Contrastive Domain Adaptation for Time-Series via Temporal Mixup

Arxiv

0+阅读 · 2023年7月27日

On the Generalization Effects of Linear Transformations in Data Augmentation

Arxiv

0+阅读 · 2023年7月26日

Memory-Efficient Graph Convolutional Networks for Object Classification and Detection with Event Cameras

Arxiv

0+阅读 · 2023年7月26日

Efficient Estimation of the Local Robustness of Machine Learning Models

Arxiv

0+阅读 · 2023年7月26日

Attribute-Guided Adversarial Training for Robustness to Natural Perturbations

Arxiv

15+阅读 · 2020年12月3日

VIP会员

文章信息

相关主题

样本复杂度

最新内容

美国马六甲“三重网”概念：安全网、威慑网与杀伤网

美国马六甲“三重网”概念：安全网、威慑网与杀伤网

专知会员服务

3+阅读 · 今天8:18

《面向导弹有效发射时机的监督机器学习方法：基于超视距空战仿真》

《面向导弹有效发射时机的监督机器学习方法：基于超视距空战仿真》

专知会员服务

3+阅读 · 今天7:39

《通用大语言模型：无人机指挥与控制接口》最新40页

《通用大语言模型：无人机指挥与控制接口》最新40页

专知会员服务

7+阅读 · 今天7:33

《通过小型无人机系统将情报能力“作战化”》

《通过小型无人机系统将情报能力“作战化”》

专知会员服务

3+阅读 · 今天7:28

《神经安全型有人–无人协同：面向认知自适应作战能力的参考架构》

《神经安全型有人–无人协同：面向认知自适应作战能力的参考架构》

专知会员服务

4+阅读 · 今天7:14

《在指挥链中通过多准则决策分析传达指挥官意图：空战实验》

《在指挥链中通过多准则决策分析传达指挥官意图：空战实验》

专知会员服务

18+阅读 · 6月15日

消耗优势：美军的“精确规模化”概念

消耗优势：美军的“精确规模化”概念

专知会员服务

7+阅读 · 6月15日

五角大楼的AI优先战略及其对现代战争的启示：来自与伊朗冲突的经验教训

五角大楼的AI优先战略及其对现代战争的启示：来自与伊朗冲突的经验教训

专知会员服务

8+阅读 · 6月15日

《网络空间兵棋推演：挑战、局限性与混合路径》报告

《网络空间兵棋推演：挑战、局限性与混合路径》报告

专知会员服务

8+阅读 · 6月15日

《离线语言支持系统：面向空战战术决策》

《离线语言支持系统：面向空战战术决策》

专知会员服务

8+阅读 · 6月15日

《以通信为中心的6G–LLM架构：面向可扩展的战术自主防御车辆网络》

《以通信为中心的6G–LLM架构：面向可扩展的战术自主防御车辆网络》

专知会员服务

6+阅读 · 6月15日

ICML 2026｜ECA：面向开放式图文生成的高效持续对齐

ICML 2026｜ECA：面向开放式图文生成的高效持续对齐

专知会员服务

6+阅读 · 6月14日

可信智能体AI综述：安全、鲁棒性、隐私与系统安全

可信智能体AI综述：安全、鲁棒性、隐私与系统安全

专知会员服务

6+阅读 · 6月14日

俄乌战场地面机器人如何改写战争规则

俄乌战场地面机器人如何改写战争规则

专知会员服务

9+阅读 · 6月14日

美国海军研究生院第23届年度采购研究研讨会与创新峰会：主题“加速作战能力”，附会议报告论文集1300页

美国海军研究生院第23届年度采购研究研讨会与创新峰会：主题“加速作战能力”，附会议报告论文集1300页

专知会员服务

13+阅读 · 6月14日

相关VIP内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【ETH】最新《几何数据分析》2020课程，附PPT下载

专知会员服务

45+阅读 · 2020年12月18日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

52+阅读 · 2020年12月14日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《面向导弹有效发射时机的监督机器学习方法：基于超视距空战仿真》

《通过小型无人机系统将情报能力“作战化”》

美国马六甲“三重网”概念：安全网、威慑网与杀伤网

《通用大语言模型：无人机指挥与控制接口》最新40页

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

专知

13+阅读 · 2018年6月24日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

【推荐】GAN架构入门综述(资源汇总)

【推荐】GAN架构入门综述(资源汇总)

机器学习研究会

10+阅读 · 2017年9月3日

相关论文

ODTlearn: A Package for Learning Optimal Decision Trees for Prediction and Prescription

ODTlearn: A Package for Learning Optimal Decision Trees for Prediction and Prescription

Arxiv

0+阅读 · 2023年7月28日

f-Divergence Minimization for Sequence-Level Knowledge Distillation

Arxiv

0+阅读 · 2023年7月27日

From Contextual Data to Newsvendor Decisions: On the Actual Performance of Data-Driven Algorithms

Arxiv

1+阅读 · 2023年7月27日

Simplified Concrete Dropout -- Improving the Generation of Attribution Masks for Fine-grained Classification

Arxiv

0+阅读 · 2023年7月27日

A Verified Efficient Implementation of the Weighted Path Order

Arxiv

0+阅读 · 2023年7月27日

Contrastive Domain Adaptation for Time-Series via Temporal Mixup

Arxiv

0+阅读 · 2023年7月27日

On the Generalization Effects of Linear Transformations in Data Augmentation

Arxiv

0+阅读 · 2023年7月26日

Memory-Efficient Graph Convolutional Networks for Object Classification and Detection with Event Cameras

Arxiv

0+阅读 · 2023年7月26日

Efficient Estimation of the Local Robustness of Machine Learning Models

Arxiv

0+阅读 · 2023年7月26日

Attribute-Guided Adversarial Training for Robustness to Natural Perturbations

Arxiv

15+阅读 · 2020年12月3日

相关基金

空间分数阶质量守恒型Allen-Cahn方程的高效数值算法研究

国家自然科学基金

0+阅读 · 2015年12月31日

Schr？dinger-Poisson方程守恒DDG方法研究

国家自然科学基金

2+阅读 · 2015年12月31日

随机辛算法和多辛算法

国家自然科学基金

2+阅读 · 2014年12月31日

长链非编码RNA CAR intergenic 10在细胞衰老中的作用和机制

国家自然科学基金

1+阅读 · 2013年12月31日

负载型过渡金属碳化物催化剂的合成、表征及在染料敏化太阳能电池中的应用

国家自然科学基金

0+阅读 · 2013年12月31日

随机泛函微分方程的适定性与渐近性分析

国家自然科学基金

0+阅读 · 2012年12月31日

不可靠通信环境下复杂动态网络状态估计与故障诊断

国家自然科学基金

0+阅读 · 2012年12月31日

WRKY类转录因子在托品烷类生物碱生物合成中的调控作用研究

国家自然科学基金

0+阅读 · 2012年12月31日

多智能体不确定性系统的自适应一致性问题研究

国家自然科学基金

6+阅读 · 2012年12月31日

以带隙可调的Zn(O,S)梯度薄膜为缓层的CuInS2薄膜太阳能电池研究

国家自然科学基金

1+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员