Comparing Linear Probes with Mahalanobis Cosine Similarity - 专知论文

会员服务 ·

0

线性的 · 余弦 · 相似度 · 余弦相似度 · 测试数据 ·

Comparing Linear Probes with Mahalanobis Cosine Similarity

翻译：暂无翻译

Zhuofan Josh Ying,Peter Hase,Nikolaus Kriegeskorte

from arxiv, 16 pages, 10 figures

Linear probes are widely used in interpretability research and often compared by cosine similarity. The Mahalanobis cosine similarity (MCS) between two directions, which reweights the inner product by test data covariance, is a natural task-aware refinement. Ying et al. (2026) report that a probe's MCS to a reference probe trained on the out-of-distribution (OOD) data near-perfectly linearly predicts the probe's OOD AUROC (R^2 = 0.98). Here, we extend this empirical finding across models, layers, and concept domains, and prove this general phenomenon in closed form: For balanced classes whose projections are Gaussian, OOD AUROC and MCS to the reference probe are linear because both are sigmoid-shaped functions of the probe's signal-to-noise ratio (SNR) on the test data. The theory also predicts when this linearity fails, which we verify empirically. MCS offers a theoretically grounded and empirically effective alternative to Euclidean cosine similarity for comparing linear probes.

翻译：暂无翻译

0

相关内容

线性的

AAAI2024 | 关于曲率多样性的探索和研究——结合motif的多曲率图卷积网络

AAAI2024 | 关于曲率多样性的探索和研究——结合motif的多曲率图卷积网络

专知会员服务

16+阅读 · 2024年4月14日

EMNLP2023：Schema自适应的知识图谱构建

EMNLP2023：Schema自适应的知识图谱构建

专知会员服务

44+阅读 · 2023年12月3日

中科大最新2022《MATH1009.08: 线性代数（B1）》课程

中科大最新2022《MATH1009.08: 线性代数（B1）》课程

专知会员服务

35+阅读 · 2022年5月19日

【CVPR2022】基于粗-精视觉Transformer的仿射医学图像配准

【CVPR2022】基于粗-精视觉Transformer的仿射医学图像配准

专知会员服务

36+阅读 · 2022年4月2日

【NeurIPS 2021】BernNet: 通过Bernstein学习任意图谱滤波器

专知会员服务

10+阅读 · 2021年10月1日

【SIGMOD2020】知识图谱补全方法的现实再评价，Realistic Re-evaluation of Knowledge Graph Completion Methods: An Experimental Study

【SIGMOD2020】知识图谱补全方法的现实再评价，Realistic Re-evaluation of Knowledge Graph Completion Methods: An Experimental Study

专知会员服务

33+阅读 · 2020年3月23日

【电子书|交互式线性代数】《Interactive Linear Algebra》by Dan Margalit, Joseph Rabinoff（附455页pdf）

【电子书|交互式线性代数】《Interactive Linear Algebra》by Dan Margalit, Joseph Rabinoff（附455页pdf）

专知会员服务

69+阅读 · 2019年11月30日

【Nature论文】用于理解图像分类决策和改进神经网络鲁棒性的对抗性解释（Adversarial Explanations for Understanding Image Classiﬁcation Decisions and Improved Neural Network Robustness ）

【Nature论文】用于理解图像分类决策和改进神经网络鲁棒性的对抗性解释（Adversarial Explanations for Understanding Image Classiﬁcation Decisions and Improved Neural Network Robustness ）

专知会员服务

13+阅读 · 2019年11月25日

【CoRL2019最佳论文】模仿学习，A Divergence Minimization Perspective on Imitation Learning Methods

【CoRL2019最佳论文】模仿学习，A Divergence Minimization Perspective on Imitation Learning Methods

专知会员服务

24+阅读 · 2019年11月11日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Link prediction | 三篇SEAL相关工作小结

Link prediction | 三篇SEAL相关工作小结

AINLP

48+阅读 · 2020年11月17日

初学者系列：Attentional Factorization Machines（AFM）详解

初学者系列：Attentional Factorization Machines（AFM）详解

专知

82+阅读 · 2019年9月16日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

CosFace: Large Margin Cosine Loss for Deep Face Recognition论文笔记

CosFace: Large Margin Cosine Loss for Deep Face Recognition论文笔记

统计学习与视觉计算组

44+阅读 · 2018年4月25日

Focal Loss for Dense Object Detection

Focal Loss for Dense Object Detection

统计学习与视觉计算组

12+阅读 · 2018年3月15日

Github 项目推荐 | 用 Pytorch 实现的 Capsule Network

Github 项目推荐 | 用 Pytorch 实现的 Capsule Network

AI研习社

22+阅读 · 2018年3月7日

概率图模型体系：HMM、MEMM、CRF

概率图模型体系：HMM、MEMM、CRF

机器学习研究会

30+阅读 · 2018年2月10日

From Softmax to Sparsemax-ICML16（1）

From Softmax to Sparsemax-ICML16（1）

KingsGarden

74+阅读 · 2016年11月26日

单分子拉曼散射过程非线性与相干性的研究

国家自然科学基金

0+阅读 · 2015年12月31日

斜拉桥上无缝线路梁轨相互作用机理及计算方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

由单负美特材料(metamaterials)组成的复合结构中电磁波的非线性传播与调控研究

国家自然科学基金

0+阅读 · 2015年12月31日

星载多基线与升降轨InSAR提取DEM方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

非光滑非凸优化问题的交替线性化算法及其应用

国家自然科学基金

6+阅读 · 2015年12月31日

大规模MIMO-OFDM系统中的同相/正交支路不平衡问题及其补偿方法研究

国家自然科学基金

1+阅读 · 2015年12月31日

Massive MIMO 系统中接收端低复杂度检测技术研究

国家自然科学基金

0+阅读 · 2015年12月31日

稀疏性多维联合优化在线视觉跟踪方法研究

国家自然科学基金

1+阅读 · 2015年12月31日

基于马尔科夫链的线性系统求解问题的高效算法研究

国家自然科学基金

0+阅读 · 2015年12月31日

基于高重复频率掺镱光纤光梳的相干拉曼光谱成像技术研究

国家自然科学基金

0+阅读 · 2014年12月31日

Scaling Linear Mode Connectivity and Merging to Billion Parameter Pretrained Transformers

Arxiv

0+阅读 · 6月22日

Neural Networks as Linear Regression: An Introduction for Statisticians

Arxiv

0+阅读 · 6月22日

MaRS: Robust Out-of-Distribution Detection via Mahalanobis Residual Scoring

Arxiv

0+阅读 · 6月21日

DeformX: A Versatile Co-Simulation Framework for Deformable Linear Objects

Arxiv

0+阅读 · 6月20日

THREAD: Trajectory Planning for Hybrid Rigid-Soft Manipulators with Environment-Aware Diffusion

Arxiv

0+阅读 · 6月19日

Linear Recurrent Unit with Semantic Modulation for Image Super-Resolution

Arxiv

0+阅读 · 6月18日

A Forward Simulation-Based Hierarchy of Linearizable Concurrent Objects

Arxiv

0+阅读 · 6月18日

Representing Piecewise-Linear Functions by Functions with Minimal Arity

Arxiv

0+阅读 · 6月17日

Neural Inference Functions for Margins for Time Series Copula Models

Arxiv

0+阅读 · 6月15日

The Optimal Sample Complexity of Linear Contracts

Arxiv

0+阅读 · 5月27日

VIP会员

文章信息

相关主题

余弦相似度

最新内容

ICML 2026 | 边界嵌入塑形：用自适应对比学习破解图结构纠缠

ICML 2026 | 边界嵌入塑形：用自适应对比学习破解图结构纠缠

专知会员服务

4+阅读 · 6月22日

综述 | 3D场景图：开放挑战与未来方向

综述 | 3D场景图：开放挑战与未来方向

专知会员服务

7+阅读 · 6月22日

《国防工业6.0：全自主作战系统、量子-人工智能融合与新一代战略威慑》

《国防工业6.0：全自主作战系统、量子-人工智能融合与新一代战略威慑》

专知会员服务

6+阅读 · 6月22日

21世纪的无人机战争

21世纪的无人机战争

专知会员服务

4+阅读 · 6月22日

《伊朗与以色列-美国热战及其对数字技术的影响》

《伊朗与以色列-美国热战及其对数字技术的影响》

专知会员服务

5+阅读 · 6月22日

《量子技术的军事任务技术适配与利用》

《量子技术的军事任务技术适配与利用》

专知会员服务

5+阅读 · 6月22日

《美国陆军军官学校（西点军校）本科生科研中生成式人工智能的使用》

《美国陆军军官学校（西点军校）本科生科研中生成式人工智能的使用》

专知会员服务

7+阅读 · 6月22日

美国从乌克兰无人机战争中学习经验

美国从乌克兰无人机战争中学习经验

专知会员服务

7+阅读 · 6月21日

ICML 2026 | 面向视觉语言模型的语义鲁棒性认证

ICML 2026 | 面向视觉语言模型的语义鲁棒性认证

专知会员服务

5+阅读 · 6月21日

综述 | 智能体电子设计自动化：从“交接有效性”重新理解Agentic EDA

综述 | 智能体电子设计自动化：从“交接有效性”重新理解Agentic EDA

专知会员服务

8+阅读 · 6月21日

深入解读 Palantir AIP：全球最具争议的人工智能平台究竟如何运作

深入解读 Palantir AIP：全球最具争议的人工智能平台究竟如何运作

专知会员服务

22+阅读 · 6月20日

ICML 2026 | 多任务贝叶斯上下文学习：让 Transformer 在测试时显式适应新先验

ICML 2026 | 多任务贝叶斯上下文学习：让 Transformer 在测试时显式适应新先验

专知会员服务

5+阅读 · 6月19日

ACL 2026综述 | 大规模手语数据集：资源、基准与标注标准

ACL 2026综述 | 大规模手语数据集：资源、基准与标注标准

专知会员服务

8+阅读 · 6月19日

ICML 2026 Spotlight | SmoothSMoE：解析稀疏 MoE 路由不连续

ICML 2026 Spotlight | SmoothSMoE：解析稀疏 MoE 路由不连续

专知会员服务

7+阅读 · 6月18日

综述 | 周期表视角下的大模型推理：范式、方法与失败模式

综述 | 周期表视角下的大模型推理：范式、方法与失败模式

专知会员服务

10+阅读 · 6月18日

相关VIP内容

AAAI2024 | 关于曲率多样性的探索和研究——结合motif的多曲率图卷积网络

AAAI2024 | 关于曲率多样性的探索和研究——结合motif的多曲率图卷积网络

专知会员服务

16+阅读 · 2024年4月14日

EMNLP2023：Schema自适应的知识图谱构建

EMNLP2023：Schema自适应的知识图谱构建

专知会员服务

44+阅读 · 2023年12月3日

中科大最新2022《MATH1009.08: 线性代数（B1）》课程

中科大最新2022《MATH1009.08: 线性代数（B1）》课程

专知会员服务

35+阅读 · 2022年5月19日

【CVPR2022】基于粗-精视觉Transformer的仿射医学图像配准

【CVPR2022】基于粗-精视觉Transformer的仿射医学图像配准

专知会员服务

36+阅读 · 2022年4月2日

【NeurIPS 2021】BernNet: 通过Bernstein学习任意图谱滤波器

专知会员服务

10+阅读 · 2021年10月1日

【SIGMOD2020】知识图谱补全方法的现实再评价，Realistic Re-evaluation of Knowledge Graph Completion Methods: An Experimental Study

【SIGMOD2020】知识图谱补全方法的现实再评价，Realistic Re-evaluation of Knowledge Graph Completion Methods: An Experimental Study

专知会员服务

33+阅读 · 2020年3月23日

【电子书|交互式线性代数】《Interactive Linear Algebra》by Dan Margalit, Joseph Rabinoff（附455页pdf）

【电子书|交互式线性代数】《Interactive Linear Algebra》by Dan Margalit, Joseph Rabinoff（附455页pdf）

专知会员服务

69+阅读 · 2019年11月30日

【Nature论文】用于理解图像分类决策和改进神经网络鲁棒性的对抗性解释（Adversarial Explanations for Understanding Image Classiﬁcation Decisions and Improved Neural Network Robustness ）

【Nature论文】用于理解图像分类决策和改进神经网络鲁棒性的对抗性解释（Adversarial Explanations for Understanding Image Classiﬁcation Decisions and Improved Neural Network Robustness ）

专知会员服务

13+阅读 · 2019年11月25日

【CoRL2019最佳论文】模仿学习，A Divergence Minimization Perspective on Imitation Learning Methods

【CoRL2019最佳论文】模仿学习，A Divergence Minimization Perspective on Imitation Learning Methods

专知会员服务

24+阅读 · 2019年11月11日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

热门VIP内容

开通专知VIP会员享更多权益服务

综述 | 3D场景图：开放挑战与未来方向

21世纪的无人机战争

ICML 2026 | 边界嵌入塑形：用自适应对比学习破解图结构纠缠

《国防工业6.0：全自主作战系统、量子-人工智能融合与新一代战略威慑》

相关资讯

Link prediction | 三篇SEAL相关工作小结

Link prediction | 三篇SEAL相关工作小结

AINLP

48+阅读 · 2020年11月17日

初学者系列：Attentional Factorization Machines（AFM）详解

初学者系列：Attentional Factorization Machines（AFM）详解

专知

82+阅读 · 2019年9月16日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

CosFace: Large Margin Cosine Loss for Deep Face Recognition论文笔记

CosFace: Large Margin Cosine Loss for Deep Face Recognition论文笔记

统计学习与视觉计算组

44+阅读 · 2018年4月25日

Focal Loss for Dense Object Detection

Focal Loss for Dense Object Detection

统计学习与视觉计算组

12+阅读 · 2018年3月15日

Github 项目推荐 | 用 Pytorch 实现的 Capsule Network

Github 项目推荐 | 用 Pytorch 实现的 Capsule Network

AI研习社

22+阅读 · 2018年3月7日

概率图模型体系：HMM、MEMM、CRF

概率图模型体系：HMM、MEMM、CRF

机器学习研究会

30+阅读 · 2018年2月10日

From Softmax to Sparsemax-ICML16（1）

From Softmax to Sparsemax-ICML16（1）

KingsGarden

74+阅读 · 2016年11月26日

相关论文

Scaling Linear Mode Connectivity and Merging to Billion Parameter Pretrained Transformers

Arxiv

0+阅读 · 6月22日

Neural Networks as Linear Regression: An Introduction for Statisticians

Arxiv

0+阅读 · 6月22日

MaRS: Robust Out-of-Distribution Detection via Mahalanobis Residual Scoring

Arxiv

0+阅读 · 6月21日

DeformX: A Versatile Co-Simulation Framework for Deformable Linear Objects

Arxiv

0+阅读 · 6月20日

THREAD: Trajectory Planning for Hybrid Rigid-Soft Manipulators with Environment-Aware Diffusion

Arxiv

0+阅读 · 6月19日

Linear Recurrent Unit with Semantic Modulation for Image Super-Resolution

Arxiv

0+阅读 · 6月18日

A Forward Simulation-Based Hierarchy of Linearizable Concurrent Objects

Arxiv

0+阅读 · 6月18日

Representing Piecewise-Linear Functions by Functions with Minimal Arity

Arxiv

0+阅读 · 6月17日

Neural Inference Functions for Margins for Time Series Copula Models

Arxiv

0+阅读 · 6月15日

The Optimal Sample Complexity of Linear Contracts

Arxiv

0+阅读 · 5月27日

相关基金

单分子拉曼散射过程非线性与相干性的研究

国家自然科学基金

0+阅读 · 2015年12月31日

斜拉桥上无缝线路梁轨相互作用机理及计算方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

由单负美特材料(metamaterials)组成的复合结构中电磁波的非线性传播与调控研究

国家自然科学基金

0+阅读 · 2015年12月31日

星载多基线与升降轨InSAR提取DEM方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

非光滑非凸优化问题的交替线性化算法及其应用

国家自然科学基金

6+阅读 · 2015年12月31日

大规模MIMO-OFDM系统中的同相/正交支路不平衡问题及其补偿方法研究

国家自然科学基金

1+阅读 · 2015年12月31日

Massive MIMO 系统中接收端低复杂度检测技术研究

国家自然科学基金

0+阅读 · 2015年12月31日

稀疏性多维联合优化在线视觉跟踪方法研究

国家自然科学基金

1+阅读 · 2015年12月31日

基于马尔科夫链的线性系统求解问题的高效算法研究

国家自然科学基金

0+阅读 · 2015年12月31日

基于高重复频率掺镱光纤光梳的相干拉曼光谱成像技术研究

国家自然科学基金

0+阅读 · 2014年12月31日

微信扫码咨询专知VIP会员