一个新颖的稀疏正则化方法 (A Novel Sparse Regularizer) - 专知论文

会员服务 ·

0

正则化 · 稀疏正则化 · 稀疏正则化方法 · 稀疏 · 并行化 ·

2023 年 4 月 13 日

A Novel Sparse Regularizer

翻译：一个新颖的稀疏正则化方法

Hovig Tigran Bayandorian

$L_{p}$-norm regularization schemes such as $L_{0}$, $L_{1}$, and $L_{2}$-norm regularization and $L_{p}$-norm-based regularization techniques such as weight decay and group LASSO compute a quantity which depends on model weights considered in isolation from one another. This paper describes a novel regularizer which is not based on an $L_{p}$-norm. In contrast with $L_{p}$-norm-based regularization, this regularizer is concerned with the spatial arrangement of weights within a weight matrix. This regularizer is an additive term for the loss function and is differentiable, simple and fast to compute, scale-invariant, requires a trivial amount of additional memory, and can easily be parallelized. Empirically this method yields approximately a one order-of-magnitude improvement in the number of nonzero model parameters at a given level of accuracy.

翻译：$L_{p}$-范数正则化方法，如$L_{0}$、$L_{1}$和$L_{2}$-范数正则化以及基于$L_{p}$-范数的正则化方法，例如权重衰减和组LASSO，都计算了与单独考虑的模型权重有关的量。本文描述了一种新颖的正则化方法，它不是基于$L_{p}$-范数的。与$L_{p}$-范数正则化方法不同，这种正则化方法关注的是权重矩阵内部权重的空间排列方式。这种正则化方法是损失函数的可添加项，并且可微分，简单易用、计算速度快，而且与比例无关，需要极少量的附加内存，并且容易并行化。经验上，该方法在达到相同精度的情况下，可使非零模型参数的数量提高约一个数量级。

0

相关内容

正则化

在数学，统计学和计算机科学中，尤其是在机器学习和逆问题中，正则化是添加信息以解决不适定问题或防止过度拟合的过程。正则化适用于不适定的优化问题中的目标函数。

【CVPR2022】端到端实时矢量边缘提取（E2EC）

【CVPR2022】端到端实时矢量边缘提取（E2EC）

专知会员服务

16+阅读 · 2022年4月14日

【Alex Nowak-Vila博士论文】有理论保证的结构化预测， Structured Prediction with Theoretical Guarantees

【Alex Nowak-Vila博士论文】有理论保证的结构化预测， Structured Prediction with Theoretical Guarantees

专知会员服务

13+阅读 · 2022年3月15日

【WWW2021】高效的非抽样知识图谱嵌入

专知会员服务

38+阅读 · 2021年4月25日

【TPAMI2021】鲁棒可微SVD，Robust Differentiable SVD

专知会员服务

23+阅读 · 2021年4月10日

【Aalto博士论文】高效样本近似贝叶斯计算的高斯过程代理方法，84页pdf

专知会员服务

35+阅读 · 2020年9月30日

最大均方差正则化贝叶斯神经网络，Bayesian Neural Networks With Maximum Mean Discrepancy Regularization

最大均方差正则化贝叶斯神经网络，Bayesian Neural Networks With Maximum Mean Discrepancy Regularization

专知会员服务

54+阅读 · 2020年3月5日

【斯坦福大学】Dropout的隐性和显性正则化效应，Regularization Effects

【斯坦福大学】Dropout的隐性和显性正则化效应，Regularization Effects

专知会员服务

34+阅读 · 2020年3月4日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

灾难性遗忘问题新视角：迁移-干扰平衡

灾难性遗忘问题新视角：迁移-干扰平衡

CreateAMind

17+阅读 · 2019年7月6日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

【论文推荐】最新八篇情感分析相关论文—Pair-wise判别器、多模态情感分析、上下文语境、Gated 卷积网络

【论文推荐】最新八篇情感分析相关论文—Pair-wise判别器、多模态情感分析、上下文语境、Gated 卷积网络

专知

20+阅读 · 2018年6月29日

【论文推荐】最新八篇生成对抗网络相关论文—BRE、图像合成、多模态图像生成、非配对多域图、注意力、对抗特征增强、深度对抗性训练

【论文推荐】最新八篇生成对抗网络相关论文—BRE、图像合成、多模态图像生成、非配对多域图、注意力、对抗特征增强、深度对抗性训练

专知

16+阅读 · 2018年5月14日

【论文推荐】最新八篇图像检索相关论文—三元组、深度特征图、判别式、卷积特征聚合、视觉-关系知识图谱、大规模图像检索

【论文推荐】最新八篇图像检索相关论文—三元组、深度特征图、判别式、卷积特征聚合、视觉-关系知识图谱、大规模图像检索

专知

33+阅读 · 2018年4月23日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

Schr？dinger-Poisson方程守恒DDG方法研究

国家自然科学基金

2+阅读 · 2015年12月31日

基于包间距离、直接以包为学习对象的多示例学习维数约减问题研究

国家自然科学基金

0+阅读 · 2013年12月31日

医学图像分割的新变分模型及其快速有效的最优化算法

国家自然科学基金

0+阅读 · 2013年12月31日

稀有变异的有效发现与识别

国家自然科学基金

0+阅读 · 2013年12月31日

柑橘黄龙病亚洲种病原( Cadidatus Liberibacter assiaticus)重组抗体的研究

国家自然科学基金

0+阅读 · 2012年12月31日

拟Frobenius-Lusztig核

国家自然科学基金

0+阅读 · 2012年12月31日

汉滩病毒活化TLR4-TRAF6-SFK信号通路致血管内皮细胞通透性升高的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

奇异积分算子及其交换子的理论与应用

国家自然科学基金

0+阅读 · 2010年12月31日

基于Sparse-Land模型的SAR图像噪声抑制与分割

国家自然科学基金

0+阅读 · 2009年12月31日

p进表示的伽罗瓦上同调

国家自然科学基金

0+阅读 · 2008年12月31日

Lasso in infinite dimension: application to variable selection in functional multivariate linear regression

Arxiv

0+阅读 · 2023年5月31日

On the Approximability of External-Influence-Driven Problems

Arxiv

0+阅读 · 2023年5月30日

PlaNeRF: SVD Unsupervised 3D Plane Regularization for NeRF Large-Scale Scene Reconstruction

Arxiv

0+阅读 · 2023年5月30日

GCondNet: A Novel Method for Improving Neural Networks on Small High-Dimensional Tabular Data

Arxiv

0+阅读 · 2023年5月29日

Parsel: Algorithmic Reasoning with Language Models by Composing Decompositions

Arxiv

0+阅读 · 2023年5月28日

Statistical Inference in High-Dimensional Generalized Linear Models with Asymmetric Link Functions

Arxiv

0+阅读 · 2023年5月28日

Unconstrained Dynamic Regret via Sparse Coding

Arxiv

0+阅读 · 2023年5月27日

Multi-Fidelity Covariance Estimation in the Log-Euclidean Geometry

Arxiv

0+阅读 · 2023年5月26日

Automatic sparse PCA for high-dimensional data

Automatic sparse PCA for high-dimensional data

Arxiv

0+阅读 · 2023年5月26日

Addressing bias in online selection with limited budget of comparisons

Arxiv

0+阅读 · 2023年5月26日

VIP会员

文章信息

相关主题

稀疏正则化

稀疏正则化方法

相关VIP内容

【CVPR2022】端到端实时矢量边缘提取（E2EC）

【CVPR2022】端到端实时矢量边缘提取（E2EC）

专知会员服务

16+阅读 · 2022年4月14日

【Alex Nowak-Vila博士论文】有理论保证的结构化预测， Structured Prediction with Theoretical Guarantees

【Alex Nowak-Vila博士论文】有理论保证的结构化预测， Structured Prediction with Theoretical Guarantees

专知会员服务

13+阅读 · 2022年3月15日

【WWW2021】高效的非抽样知识图谱嵌入

专知会员服务

38+阅读 · 2021年4月25日

【TPAMI2021】鲁棒可微SVD，Robust Differentiable SVD

专知会员服务

23+阅读 · 2021年4月10日

【Aalto博士论文】高效样本近似贝叶斯计算的高斯过程代理方法，84页pdf

专知会员服务

35+阅读 · 2020年9月30日

最大均方差正则化贝叶斯神经网络，Bayesian Neural Networks With Maximum Mean Discrepancy Regularization

最大均方差正则化贝叶斯神经网络，Bayesian Neural Networks With Maximum Mean Discrepancy Regularization

专知会员服务

54+阅读 · 2020年3月5日

【斯坦福大学】Dropout的隐性和显性正则化效应，Regularization Effects

【斯坦福大学】Dropout的隐性和显性正则化效应，Regularization Effects

专知会员服务

34+阅读 · 2020年3月4日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

论学习、公平性与复杂度

《整合杀伤链：一个用于边缘目标验证与战术推理的零样本框架》最新资料

2025中国人工智能学会系列白皮书⸺棋盘上的人工智能|附下载

通用智能体评估的逻辑架构

相关资讯

灾难性遗忘问题新视角：迁移-干扰平衡

灾难性遗忘问题新视角：迁移-干扰平衡

CreateAMind

17+阅读 · 2019年7月6日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

【论文推荐】最新八篇情感分析相关论文—Pair-wise判别器、多模态情感分析、上下文语境、Gated 卷积网络

【论文推荐】最新八篇情感分析相关论文—Pair-wise判别器、多模态情感分析、上下文语境、Gated 卷积网络

专知

20+阅读 · 2018年6月29日

【论文推荐】最新八篇生成对抗网络相关论文—BRE、图像合成、多模态图像生成、非配对多域图、注意力、对抗特征增强、深度对抗性训练

【论文推荐】最新八篇生成对抗网络相关论文—BRE、图像合成、多模态图像生成、非配对多域图、注意力、对抗特征增强、深度对抗性训练

专知

16+阅读 · 2018年5月14日

【论文推荐】最新八篇图像检索相关论文—三元组、深度特征图、判别式、卷积特征聚合、视觉-关系知识图谱、大规模图像检索

【论文推荐】最新八篇图像检索相关论文—三元组、深度特征图、判别式、卷积特征聚合、视觉-关系知识图谱、大规模图像检索

专知

33+阅读 · 2018年4月23日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

相关论文

Lasso in infinite dimension: application to variable selection in functional multivariate linear regression

Arxiv

0+阅读 · 2023年5月31日

On the Approximability of External-Influence-Driven Problems

Arxiv

0+阅读 · 2023年5月30日

PlaNeRF: SVD Unsupervised 3D Plane Regularization for NeRF Large-Scale Scene Reconstruction

Arxiv

0+阅读 · 2023年5月30日

GCondNet: A Novel Method for Improving Neural Networks on Small High-Dimensional Tabular Data

Arxiv

0+阅读 · 2023年5月29日

Parsel: Algorithmic Reasoning with Language Models by Composing Decompositions

Arxiv

0+阅读 · 2023年5月28日

Statistical Inference in High-Dimensional Generalized Linear Models with Asymmetric Link Functions

Arxiv

0+阅读 · 2023年5月28日

Unconstrained Dynamic Regret via Sparse Coding

Arxiv

0+阅读 · 2023年5月27日

Multi-Fidelity Covariance Estimation in the Log-Euclidean Geometry

Arxiv

0+阅读 · 2023年5月26日

Automatic sparse PCA for high-dimensional data

Automatic sparse PCA for high-dimensional data

Arxiv

0+阅读 · 2023年5月26日

Addressing bias in online selection with limited budget of comparisons

Arxiv

0+阅读 · 2023年5月26日

相关基金

Schr？dinger-Poisson方程守恒DDG方法研究

国家自然科学基金

2+阅读 · 2015年12月31日

基于包间距离、直接以包为学习对象的多示例学习维数约减问题研究

国家自然科学基金

0+阅读 · 2013年12月31日

医学图像分割的新变分模型及其快速有效的最优化算法

国家自然科学基金

0+阅读 · 2013年12月31日

稀有变异的有效发现与识别

国家自然科学基金

0+阅读 · 2013年12月31日

柑橘黄龙病亚洲种病原( Cadidatus Liberibacter assiaticus)重组抗体的研究

国家自然科学基金

0+阅读 · 2012年12月31日

拟Frobenius-Lusztig核

国家自然科学基金

0+阅读 · 2012年12月31日

汉滩病毒活化TLR4-TRAF6-SFK信号通路致血管内皮细胞通透性升高的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

奇异积分算子及其交换子的理论与应用

国家自然科学基金

0+阅读 · 2010年12月31日

基于Sparse-Land模型的SAR图像噪声抑制与分割

国家自然科学基金

0+阅读 · 2009年12月31日

p进表示的伽罗瓦上同调

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员