Effects of sparsity and superposition on loss in simple autoencoders - 专知论文

会员服务 ·

0

特化 · 损失 · SimPLe · 自编码器 · Neural Networks ·

Effects of sparsity and superposition on loss in simple autoencoders

翻译：暂无翻译

Mriganka Basu Roy Chowdhury,Eric McLaughlin Weiner

from arxiv, 16 pages, 3 figures

One of the major difficulties in the mechanistic interpretability of neural networks is the occurrence of polysemanticity, which suggests that each neuron is typically responsible for multiple different tasks, impeding a clean interpretation of their function. The seminal paper of Elhage et al. (2022) argues that this occurs due to superposition, a phenomenon where the neural network represents distinct features as non-orthogonal directions in a lower-dimensional space, a strategy that allows much greater compression of the data without sacrificing fidelity due to the feature sparsity of input vectors. Elhage et al. (2022) empirically validates these hypotheses in a rather natural and simple autoencoder with sparse inputs. The contribution of the present work is to analyze the mathematical basis for the occurrence and optimality of superposition, while rigorously corroborating some of their findings. In particular, we provide upper and lower bounds for the L2 reconstruction loss, tight in the very sparse regime, for power activation functions. A short list of interesting open problems are also included at the end.

翻译：暂无翻译

0

相关内容

【斯坦福博士论文】时序平滑性假设下的深度神经网络自适应与正则化方法

【斯坦福博士论文】时序平滑性假设下的深度神经网络自适应与正则化方法

专知会员服务

15+阅读 · 2025年3月25日

[ICML2022] NeuroFluid: 流体仿真的人工智能新范式

[ICML2022] NeuroFluid: 流体仿真的人工智能新范式

专知会员服务

28+阅读 · 2022年6月8日

【USC-Aaron Chan博士答辩Slides】可信自然语言处理机器解释的生成与利用, 242页ppt，Generating and Utilizing Machine Explanations for Trustworthy NLP

【USC-Aaron Chan博士答辩Slides】可信自然语言处理机器解释的生成与利用, 242页ppt，Generating and Utilizing Machine Explanations for Trustworthy NLP

专知会员服务

16+阅读 · 2022年3月13日

【CVPR 2022】面向无噪声对象轮廓的弱监督语义分割，Towards Noiseless Object Contours for Weakly Supervised Semantic Segmentation

【CVPR 2022】面向无噪声对象轮廓的弱监督语义分割，Towards Noiseless Object Contours for Weakly Supervised Semantic Segmentation

专知会员服务

10+阅读 · 2022年3月12日

【WWW2021】用优化框架解释和统一图神经网络

【WWW2021】用优化框架解释和统一图神经网络

专知会员服务

45+阅读 · 2021年2月1日

【CVPR2020-中科院计算所】弱监督语义分割的自监督等价注意力机制，Self-supervised Equivariant Attention Mechanism for Weakly Supervised Semantic Segmentation

【CVPR2020-中科院计算所】弱监督语义分割的自监督等价注意力机制，Self-supervised Equivariant Attention Mechanism for Weakly Supervised Semantic Segmentation

专知会员服务

76+阅读 · 2020年4月10日

八篇NeurIPS 2019【图神经网络（GNN）】相关论文

八篇NeurIPS 2019【图神经网络（GNN）】相关论文

专知会员服务

44+阅读 · 2020年1月10日

【机器学习论文推荐】EfficientNet:卷积神经网络的再思考模型缩放（EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks）

【机器学习论文推荐】EfficientNet:卷积神经网络的再思考模型缩放（EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks）

专知会员服务

17+阅读 · 2019年12月24日

【Nature论文】用于理解图像分类决策和改进神经网络鲁棒性的对抗性解释（Adversarial Explanations for Understanding Image Classiﬁcation Decisions and Improved Neural Network Robustness ）

【Nature论文】用于理解图像分类决策和改进神经网络鲁棒性的对抗性解释（Adversarial Explanations for Understanding Image Classiﬁcation Decisions and Improved Neural Network Robustness ）

专知会员服务

13+阅读 · 2019年11月25日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

【CVPR2021】面向通用领域自适应的领域共识聚类

【CVPR2021】面向通用领域自适应的领域共识聚类

专知

24+阅读 · 2021年5月6日

初学者系列：Attentional Factorization Machines（AFM）详解

初学者系列：Attentional Factorization Machines（AFM）详解

专知

82+阅读 · 2019年9月16日

谷歌开源新模型EfficientNet：图像识别效率提升10倍，参数减少88%

谷歌开源新模型EfficientNet：图像识别效率提升10倍，参数减少88%

AI前线

15+阅读 · 2019年6月9日

神经网络图的简介（基本概念，DeepWalk以及GraphSage算法）

神经网络图的简介（基本概念，DeepWalk以及GraphSage算法）

AI研习社

12+阅读 · 2019年3月5日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Single-Shot Object Detection with Enriched Semantics

Single-Shot Object Detection with Enriched Semantics

统计学习与视觉计算组

14+阅读 · 2018年8月29日

论文浅尝 | 基于神经网络的推理（DeepMind Relational Reasoning）

论文浅尝 | 基于神经网络的推理（DeepMind Relational Reasoning）

开放知识图谱

14+阅读 · 2018年4月3日

论文浅尝 | Question Answering over Freebase

论文浅尝 | Question Answering over Freebase

开放知识图谱

19+阅读 · 2018年1月9日

CNN、RNN在自动特征提取中的应用

CNN、RNN在自动特征提取中的应用

乌镇智库

14+阅读 · 2017年8月4日

IJCAI | Cascade Dynamics Modeling with Attention-based RNN

IJCAI | Cascade Dynamics Modeling with Attention-based RNN

KingsGarden

13+阅读 · 2017年7月16日

多重网络中的级联与传播过程研究

国家自然科学基金

0+阅读 · 2015年12月31日

面向空间自组网的低功耗理论与技术研究

国家自然科学基金

1+阅读 · 2015年12月31日

InP基单片集成少模光发射芯片的研究

国家自然科学基金

0+阅读 · 2015年12月31日

复杂非完整多自主体网络协同算法设计与性能极限分析

国家自然科学基金

1+阅读 · 2015年12月31日

基于自适应采样和变复杂度近似的多学科稳健性设计优化方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

支持资源自适配接入的物联网服务提供方法研究

国家自然科学基金

1+阅读 · 2015年12月31日

可与MPSoC高度融合的片上自主测试-自主修复关键技术研究：针对自然、人为可靠性威胁

国家自然科学基金

0+阅读 · 2015年12月31日

三维片上网络通信自适应容错方法研究

国家自然科学基金

0+阅读 · 2014年12月31日

基于网络的情感语义词典的自动构建技术研究

国家自然科学基金

0+阅读 · 2014年12月31日

基于交通行为的道路网络脆弱性识别及改善策略研究

国家自然科学基金

0+阅读 · 2014年12月31日

A Simplex Witness Certificate and Escape Force for Constant Collapse in Variational Autoencoders

Arxiv

0+阅读 · 6月23日

Toward Multi-Domain and Long-Tailed Quantization via Feature Alignment and Scaling

Arxiv

0+阅读 · 6月21日

Spatially Grounded Concept-Based Image Classification

Arxiv

0+阅读 · 6月19日

Simultaneously Efficient Allocation of Indivisible Items Across Multiple Dimensions

Arxiv

0+阅读 · 6月19日

A Categorial and Sheaf-Theoretic Semantics for Autonomic Component Ensembles

Arxiv

0+阅读 · 6月17日

An Optimization Framework for Automated Assessment of Biological Plausibility of Spiking Neurons

Arxiv

0+阅读 · 6月16日

Natural Language Descriptions of Deep Visual Features

Arxiv

12+阅读 · 2022年1月26日

An Introduction to Autoencoders

Arxiv

17+阅读 · 2022年1月11日

A Survey on the Explainability of Supervised Machine Learning

Arxiv

24+阅读 · 2020年11月16日

Generative Adversarial Autoencoder Networks

Arxiv

11+阅读 · 2018年3月23日

VIP会员

文章信息

相关主题

Neural Networks

最新内容

ICML 2026 | 自回归Boltzmann生成器重塑分子采样

ICML 2026 | 自回归Boltzmann生成器重塑分子采样

专知会员服务

0+阅读 · 今天15:55

GNN跨域综述：从消息传递到图基础模型

GNN跨域综述：从消息传递到图基础模型

专知会员服务

0+阅读 · 今天15:53

无人机自主控制与人工智能：系统性综述

无人机自主控制与人工智能：系统性综述

专知会员服务

11+阅读 · 今天7:25

巡飞弹与反无人机系统——现代战场的两大支柱

巡飞弹与反无人机系统——现代战场的两大支柱

专知会员服务

3+阅读 · 今天6:54

《打造“黄金舰队”》57页报告

《打造“黄金舰队”》57页报告

专知会员服务

3+阅读 · 今天6:52

《北约数字教官网络发展路径》128页报告

《北约数字教官网络发展路径》128页报告

专知会员服务

2+阅读 · 今天6:33

ECCV 2026 | MIMFlow：MIM与归一化流统一图像生成

ECCV 2026 | MIMFlow：MIM与归一化流统一图像生成

专知会员服务

7+阅读 · 6月25日

超越自回归边界：扩散模型、世界模型与SSM如何重塑代码智能

超越自回归边界：扩散模型、世界模型与SSM如何重塑代码智能

专知会员服务

6+阅读 · 6月25日

重塑决策优势：美军作战艺术与多域作战中联盟联合全域指挥控制（CJADC2）体系的融合

重塑决策优势：美军作战艺术与多域作战中联盟联合全域指挥控制（CJADC2）体系的融合

专知会员服务

10+阅读 · 6月25日

网状网络及其在军事领域的运用

网状网络及其在军事领域的运用

专知会员服务

8+阅读 · 6月25日

《意识即战场——全球安全体系中认知战的演进：乌克兰构建认知作战体系的展望》

《意识即战场——全球安全体系中认知战的演进：乌克兰构建认知作战体系的展望》

专知会员服务

8+阅读 · 6月25日

无美国参与的欧洲战争方式（万字长文）

无美国参与的欧洲战争方式（万字长文）

专知会员服务

8+阅读 · 6月25日

重构“下一场战争”的制胜理论：超越兰彻斯特方程与现代系统

重构“下一场战争”的制胜理论：超越兰彻斯特方程与现代系统

专知会员服务

10+阅读 · 6月25日

《国防工业中基于模型定义的实施：产品定义数字化转型的战略路径》90页

《国防工业中基于模型定义的实施：产品定义数字化转型的战略路径》90页

专知会员服务

9+阅读 · 6月25日

《国防领域敏感性分析白皮书》

《国防领域敏感性分析白皮书》

专知会员服务

9+阅读 · 6月25日

相关VIP内容

【斯坦福博士论文】时序平滑性假设下的深度神经网络自适应与正则化方法

【斯坦福博士论文】时序平滑性假设下的深度神经网络自适应与正则化方法

专知会员服务

15+阅读 · 2025年3月25日

[ICML2022] NeuroFluid: 流体仿真的人工智能新范式

[ICML2022] NeuroFluid: 流体仿真的人工智能新范式

专知会员服务

28+阅读 · 2022年6月8日

【USC-Aaron Chan博士答辩Slides】可信自然语言处理机器解释的生成与利用, 242页ppt，Generating and Utilizing Machine Explanations for Trustworthy NLP

【USC-Aaron Chan博士答辩Slides】可信自然语言处理机器解释的生成与利用, 242页ppt，Generating and Utilizing Machine Explanations for Trustworthy NLP

专知会员服务

16+阅读 · 2022年3月13日

【CVPR 2022】面向无噪声对象轮廓的弱监督语义分割，Towards Noiseless Object Contours for Weakly Supervised Semantic Segmentation

【CVPR 2022】面向无噪声对象轮廓的弱监督语义分割，Towards Noiseless Object Contours for Weakly Supervised Semantic Segmentation

专知会员服务

10+阅读 · 2022年3月12日

【WWW2021】用优化框架解释和统一图神经网络

【WWW2021】用优化框架解释和统一图神经网络

专知会员服务

45+阅读 · 2021年2月1日

【CVPR2020-中科院计算所】弱监督语义分割的自监督等价注意力机制，Self-supervised Equivariant Attention Mechanism for Weakly Supervised Semantic Segmentation

【CVPR2020-中科院计算所】弱监督语义分割的自监督等价注意力机制，Self-supervised Equivariant Attention Mechanism for Weakly Supervised Semantic Segmentation

专知会员服务

76+阅读 · 2020年4月10日

八篇NeurIPS 2019【图神经网络（GNN）】相关论文

八篇NeurIPS 2019【图神经网络（GNN）】相关论文

专知会员服务

44+阅读 · 2020年1月10日

【机器学习论文推荐】EfficientNet:卷积神经网络的再思考模型缩放（EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks）

【机器学习论文推荐】EfficientNet:卷积神经网络的再思考模型缩放（EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks）

专知会员服务

17+阅读 · 2019年12月24日

【Nature论文】用于理解图像分类决策和改进神经网络鲁棒性的对抗性解释（Adversarial Explanations for Understanding Image Classiﬁcation Decisions and Improved Neural Network Robustness ）

【Nature论文】用于理解图像分类决策和改进神经网络鲁棒性的对抗性解释（Adversarial Explanations for Understanding Image Classiﬁcation Decisions and Improved Neural Network Robustness ）

专知会员服务

13+阅读 · 2019年11月25日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

热门VIP内容

开通专知VIP会员享更多权益服务

GNN跨域综述：从消息传递到图基础模型

巡飞弹与反无人机系统——现代战场的两大支柱

ICML 2026 | 自回归Boltzmann生成器重塑分子采样

无人机自主控制与人工智能：系统性综述

相关资讯

【CVPR2021】面向通用领域自适应的领域共识聚类

【CVPR2021】面向通用领域自适应的领域共识聚类

专知

24+阅读 · 2021年5月6日

初学者系列：Attentional Factorization Machines（AFM）详解

初学者系列：Attentional Factorization Machines（AFM）详解

专知

82+阅读 · 2019年9月16日

谷歌开源新模型EfficientNet：图像识别效率提升10倍，参数减少88%

谷歌开源新模型EfficientNet：图像识别效率提升10倍，参数减少88%

AI前线

15+阅读 · 2019年6月9日

神经网络图的简介（基本概念，DeepWalk以及GraphSage算法）

神经网络图的简介（基本概念，DeepWalk以及GraphSage算法）

AI研习社

12+阅读 · 2019年3月5日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Single-Shot Object Detection with Enriched Semantics

Single-Shot Object Detection with Enriched Semantics

统计学习与视觉计算组

14+阅读 · 2018年8月29日

论文浅尝 | 基于神经网络的推理（DeepMind Relational Reasoning）

论文浅尝 | 基于神经网络的推理（DeepMind Relational Reasoning）

开放知识图谱

14+阅读 · 2018年4月3日

论文浅尝 | Question Answering over Freebase

论文浅尝 | Question Answering over Freebase

开放知识图谱

19+阅读 · 2018年1月9日

CNN、RNN在自动特征提取中的应用

CNN、RNN在自动特征提取中的应用

乌镇智库

14+阅读 · 2017年8月4日

IJCAI | Cascade Dynamics Modeling with Attention-based RNN

IJCAI | Cascade Dynamics Modeling with Attention-based RNN

KingsGarden

13+阅读 · 2017年7月16日

相关论文

A Simplex Witness Certificate and Escape Force for Constant Collapse in Variational Autoencoders

Arxiv

0+阅读 · 6月23日

Toward Multi-Domain and Long-Tailed Quantization via Feature Alignment and Scaling

Arxiv

0+阅读 · 6月21日

Spatially Grounded Concept-Based Image Classification

Arxiv

0+阅读 · 6月19日

Simultaneously Efficient Allocation of Indivisible Items Across Multiple Dimensions

Arxiv

0+阅读 · 6月19日

A Categorial and Sheaf-Theoretic Semantics for Autonomic Component Ensembles

Arxiv

0+阅读 · 6月17日

An Optimization Framework for Automated Assessment of Biological Plausibility of Spiking Neurons

Arxiv

0+阅读 · 6月16日

Natural Language Descriptions of Deep Visual Features

Arxiv

12+阅读 · 2022年1月26日

An Introduction to Autoencoders

Arxiv

17+阅读 · 2022年1月11日

A Survey on the Explainability of Supervised Machine Learning

Arxiv

24+阅读 · 2020年11月16日

Generative Adversarial Autoencoder Networks

Arxiv

11+阅读 · 2018年3月23日

相关基金

多重网络中的级联与传播过程研究

国家自然科学基金

0+阅读 · 2015年12月31日

面向空间自组网的低功耗理论与技术研究

国家自然科学基金

1+阅读 · 2015年12月31日

InP基单片集成少模光发射芯片的研究

国家自然科学基金

0+阅读 · 2015年12月31日

复杂非完整多自主体网络协同算法设计与性能极限分析

国家自然科学基金

1+阅读 · 2015年12月31日

基于自适应采样和变复杂度近似的多学科稳健性设计优化方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

支持资源自适配接入的物联网服务提供方法研究

国家自然科学基金

1+阅读 · 2015年12月31日

可与MPSoC高度融合的片上自主测试-自主修复关键技术研究：针对自然、人为可靠性威胁

国家自然科学基金

0+阅读 · 2015年12月31日

三维片上网络通信自适应容错方法研究

国家自然科学基金

0+阅读 · 2014年12月31日

基于网络的情感语义词典的自动构建技术研究

国家自然科学基金

0+阅读 · 2014年12月31日

基于交通行为的道路网络脆弱性识别及改善策略研究

国家自然科学基金

0+阅读 · 2014年12月31日

微信扫码咨询专知VIP会员