评估多语言语言模型中的跨语言遗忘 (Evaluating Cross-Lingual Unlearning in Multilingual Language Models) - 专知论文

会员服务 ·

0

跨语言 · 子空间 · 结构 · 语言模型 · 算法 ·

Evaluating Cross-Lingual Unlearning in Multilingual Language Models

翻译：评估多语言语言模型中的跨语言遗忘

Tyler Lizzo,Larry Heck

We present the first comprehensive evaluation of cross-lingual unlearning in multilingual LLMs. Using translated TOFU benchmarks in seven language/script variants, we test major unlearning algorithms and show that most fail to remove facts outside the training language, even when utility remains high. However, subspace-projection consistently outperforms the other methods, achieving strong cross-lingual forgetting with minimal degradation. Analysis of learned task subspaces reveals a shared interlingua structure: removing this shared subspace harms all languages, while removing language-specific components selectively affects one. These results demonstrate that multilingual forgetting depends on geometry in weight space, motivating subspace-based approaches for future unlearning systems.

翻译：我们首次对多语言大语言模型中的跨语言遗忘进行了全面评估。通过使用七种语言/文字变体的翻译版TOFU基准，我们测试了主流遗忘算法，结果表明大多数算法无法移除训练语言之外的事实信息，即使模型实用性仍保持较高水平。然而，子空间投影方法始终优于其他方法，能以最小性能损失实现强效的跨语言遗忘。对已学习任务子空间的分析揭示了共享的中间语言结构：移除该共享子空间会损害所有语言性能，而移除语言特定组件则仅选择性影响单一语言。这些结果证明多语言遗忘依赖于权重空间的几何结构，为未来基于子空间的遗忘系统设计提供了理论依据。

0

相关内容

跨语言

【超越消息传递:图神经网络的物理启发范式】Beyond Message Passing: a Physics-Inspired Paradigm for Graph Neural Networks

【超越消息传递:图神经网络的物理启发范式】Beyond Message Passing: a Physics-Inspired Paradigm for Graph Neural Networks

专知会员服务

17+阅读 · 2022年5月10日

UTC: 用于视觉对话的任务间对比学习的统一Transformer

UTC: 用于视觉对话的任务间对比学习的统一Transformer

专知会员服务

14+阅读 · 2022年5月4日

语义相似性算法演化论文，29页pdf，Evolution of Semantic Similarity - A Survey

语义相似性算法演化论文，29页pdf，Evolution of Semantic Similarity - A Survey

专知会员服务

44+阅读 · 2020年4月30日

【ACL2020-Facebook AI】大规模无监督跨语言表示学习

【ACL2020-Facebook AI】大规模无监督跨语言表示学习

专知会员服务

34+阅读 · 2020年4月5日

【ICML2020投稿论文-CMU-DeepMind-Google】用于评估跨语言泛化的大规模多语言多任务基准

【ICML2020投稿论文-CMU-DeepMind-Google】用于评估跨语言泛化的大规模多语言多任务基准

专知会员服务

14+阅读 · 2020年3月27日

【KDD2020-Tutorial】因果推理与稳定学习，Causal Inference and Stable Learning

【KDD2020-Tutorial】因果推理与稳定学习，Causal Inference and Stable Learning

专知

11+阅读 · 2020年8月28日

Python图像处理，366页pdf，Image Operators Image Processing in Python

Python图像处理，366页pdf，Image Operators Image Processing in Python

专知

15+阅读 · 2020年7月23日

【CVPR2020-旷视】DPGN：分布传播图网络的小样本学习

【CVPR2020-旷视】DPGN：分布传播图网络的小样本学习

专知

13+阅读 · 2020年4月1日

【CVPR2020-牛津-谷歌】语音到动作:动作识别的跨模态监督，Cross-modal Supervision

【CVPR2020-牛津-谷歌】语音到动作:动作识别的跨模态监督，Cross-modal Supervision

专知

10+阅读 · 2020年3月31日

神经网络机器翻译原理：LSTM、seq2seq到Zero-Shot

神经网络机器翻译原理：LSTM、seq2seq到Zero-Shot

北京思腾合力科技有限公司

11+阅读 · 2017年8月10日

分布式有监督学习的学习理论

国家自然科学基金

17+阅读 · 2015年12月31日

基于自主学习的Ad hoc Agent序贯决策研究

国家自然科学基金

46+阅读 · 2015年12月31日

高维数据下的模型平均方法

国家自然科学基金

6+阅读 · 2014年12月31日

复杂多元数据的半参数统计推断

国家自然科学基金

5+阅读 · 2014年12月31日

基于融合先验知识的机器学习的多传感器融合研究

国家自然科学基金

16+阅读 · 2013年12月31日

PlaM: Training-Free Plateau-Guided Model Merging for Better Visual Grounding in MLLMs

Arxiv

0+阅读 · 1月12日

Interpretable Text Classification Applied to the Detection of LLM-generated Creative Writing

Arxiv

0+阅读 · 1月12日

Distributional Clarity: The Hidden Driver of RL-Friendliness in Large Language Models

Arxiv

0+阅读 · 1月11日

Prototypicality Bias Reveals Blindspots in Multimodal Evaluation Metrics

Arxiv

0+阅读 · 1月10日

One Language-Free Foundation Model Is Enough for Universal Vision Anomaly Detection

Arxiv

0+阅读 · 1月9日

VIP会员

文章信息

相关主题

相关VIP内容

【超越消息传递:图神经网络的物理启发范式】Beyond Message Passing: a Physics-Inspired Paradigm for Graph Neural Networks

【超越消息传递:图神经网络的物理启发范式】Beyond Message Passing: a Physics-Inspired Paradigm for Graph Neural Networks

专知会员服务

17+阅读 · 2022年5月10日

UTC: 用于视觉对话的任务间对比学习的统一Transformer

UTC: 用于视觉对话的任务间对比学习的统一Transformer

专知会员服务

14+阅读 · 2022年5月4日

语义相似性算法演化论文，29页pdf，Evolution of Semantic Similarity - A Survey

语义相似性算法演化论文，29页pdf，Evolution of Semantic Similarity - A Survey

专知会员服务

44+阅读 · 2020年4月30日

【ACL2020-Facebook AI】大规模无监督跨语言表示学习

【ACL2020-Facebook AI】大规模无监督跨语言表示学习

专知会员服务

34+阅读 · 2020年4月5日

【ICML2020投稿论文-CMU-DeepMind-Google】用于评估跨语言泛化的大规模多语言多任务基准

【ICML2020投稿论文-CMU-DeepMind-Google】用于评估跨语言泛化的大规模多语言多任务基准

专知会员服务

14+阅读 · 2020年3月27日

热门VIP内容

开通专知VIP会员享更多权益服务

《长远博弈：全球国防人工智能现状25国或地区案例研究》600页书籍

《空中濒海区域成功作战的新兴人工智能战术》42页最新报告

人工智能接管指挥：美国防务领域的自动化

《不确定环境下的移动任务规划研究》133页

相关资讯

【KDD2020-Tutorial】因果推理与稳定学习，Causal Inference and Stable Learning

【KDD2020-Tutorial】因果推理与稳定学习，Causal Inference and Stable Learning

专知

11+阅读 · 2020年8月28日

Python图像处理，366页pdf，Image Operators Image Processing in Python

Python图像处理，366页pdf，Image Operators Image Processing in Python

专知

15+阅读 · 2020年7月23日

【CVPR2020-旷视】DPGN：分布传播图网络的小样本学习

【CVPR2020-旷视】DPGN：分布传播图网络的小样本学习

专知

13+阅读 · 2020年4月1日

【CVPR2020-牛津-谷歌】语音到动作:动作识别的跨模态监督，Cross-modal Supervision

【CVPR2020-牛津-谷歌】语音到动作:动作识别的跨模态监督，Cross-modal Supervision

专知

10+阅读 · 2020年3月31日

神经网络机器翻译原理：LSTM、seq2seq到Zero-Shot

神经网络机器翻译原理：LSTM、seq2seq到Zero-Shot

北京思腾合力科技有限公司

11+阅读 · 2017年8月10日

相关论文

PlaM: Training-Free Plateau-Guided Model Merging for Better Visual Grounding in MLLMs

Arxiv

0+阅读 · 1月12日

Interpretable Text Classification Applied to the Detection of LLM-generated Creative Writing

Arxiv

0+阅读 · 1月12日

Distributional Clarity: The Hidden Driver of RL-Friendliness in Large Language Models

Arxiv

0+阅读 · 1月11日

Prototypicality Bias Reveals Blindspots in Multimodal Evaluation Metrics

Arxiv

0+阅读 · 1月10日

One Language-Free Foundation Model Is Enough for Universal Vision Anomaly Detection

Arxiv

0+阅读 · 1月9日

相关基金

分布式有监督学习的学习理论

国家自然科学基金

17+阅读 · 2015年12月31日

基于自主学习的Ad hoc Agent序贯决策研究

国家自然科学基金

46+阅读 · 2015年12月31日

高维数据下的模型平均方法

国家自然科学基金

6+阅读 · 2014年12月31日

复杂多元数据的半参数统计推断

国家自然科学基金

5+阅读 · 2014年12月31日

基于融合先验知识的机器学习的多传感器融合研究

国家自然科学基金

16+阅读 · 2013年12月31日

微信扫码咨询专知VIP会员