Quantifying the Impact of Label Noise on Federated Learning - 专知论文

会员服务 ·

0

噪声 · 全局模型 · 联邦学习 · 收敛速度 · 理论分析 ·

2023 年 4 月 3 日

Quantifying the Impact of Label Noise on Federated Learning

翻译：标签噪声对联邦学习影响的量化研究

Shuqi Ke,Chao Huang,Xin Liu

from arxiv, Accepted by The AAAI 2023 Workshop on Representation Learning for Responsible Human-Centric AI

Federated Learning (FL) is a distributed machine learning paradigm where clients collaboratively train a model using their local (human-generated) datasets. While existing studies focus on FL algorithm development to tackle data heterogeneity across clients, the important issue of data quality (e.g., label noise) in FL is overlooked. This paper aims to fill this gap by providing a quantitative study on the impact of label noise on FL. We derive an upper bound for the generalization error that is linear in the clients' label noise level. Then we conduct experiments on MNIST and CIFAR-10 datasets using various FL algorithms. Our empirical results show that the global model accuracy linearly decreases as the noise level increases, which is consistent with our theoretical analysis. We further find that label noise slows down the convergence of FL training, and the global model tends to overfit when the noise level is high.

翻译：联邦学习（Federated Learning, FL）是一种分布式机器学习范式，客户端利用本地的（人类生成的）数据集协作训练模型。现有研究主要聚焦于应对客户端间数据异质性的联邦学习算法开发，但数据质量（例如标签噪声）这一重要问题在联邦学习中被忽视了。本文旨在弥补这一空白，通过量化研究标签噪声对联邦学习的影响。我们推导出一个泛化误差的上界，该上界与客户端的标签噪声水平呈线性关系。随后，我们使用多种联邦学习算法在MNIST和CIFAR-10数据集上进行了实验。实验结果与理论分析一致，表明全局模型准确率随噪声水平升高而线性下降。我们进一步发现，标签噪声会减缓联邦学习的训练收敛速度，并且在噪声水平较高时，全局模型倾向于过拟合。

0

相关内容

「联邦学习模型安全与隐私」研究进展

「联邦学习模型安全与隐私」研究进展

专知会员服务

69+阅读 · 2022年9月24日

【KDD22】DICE: 域攻击不变的因果学习以保护数据隐私、提升攻击迁移性和对抗鲁棒性

【KDD22】DICE: 域攻击不变的因果学习以保护数据隐私、提升攻击迁移性和对抗鲁棒性

专知会员服务

12+阅读 · 2022年8月27日

【CVPR2021】用随机标签的神经架构搜索

专知会员服务

12+阅读 · 2021年3月21日

最新《联邦学习Federated Learning》报告，Federated Learning

最新《联邦学习Federated Learning》报告，Federated Learning

专知会员服务

92+阅读 · 2020年12月2日

【ICML2020】噪声在随机梯度下降中的泛化效益，On the Generalization Benefit of Noise in Stochastic Gradient Descent

【ICML2020】噪声在随机梯度下降中的泛化效益，On the Generalization Benefit of Noise in Stochastic Gradient Descent

专知会员服务

19+阅读 · 2020年6月29日

【综述】联邦学习的威胁，Threats to Federated Learning: A Survey

【综述】联邦学习的威胁，Threats to Federated Learning: A Survey

专知会员服务

81+阅读 · 2020年3月4日

【香港科技大学】联邦半监督学习综述，A Survey on Federated Semi-supervised Learning

【香港科技大学】联邦半监督学习综述，A Survey on Federated Semi-supervised Learning

专知会员服务

89+阅读 · 2020年2月28日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

从NeurIPS 2022看域泛化：大规模实验分析和模型平均

从NeurIPS 2022看域泛化：大规模实验分析和模型平均

PaperWeekly

0+阅读 · 2022年10月23日

最新《联邦学习Federated Learning》报告，47页ppt

最新《联邦学习Federated Learning》报告，47页ppt

专知

48+阅读 · 2020年12月2日

模型攻击：鲁棒性联邦学习研究的最新进展

模型攻击：鲁棒性联邦学习研究的最新进展

机器之心

35+阅读 · 2020年6月3日

【香港科技大学】联邦半监督学习综述，A Survey on Federated Semi-supervised Learning

【香港科技大学】联邦半监督学习综述，A Survey on Federated Semi-supervised Learning

专知

20+阅读 · 2020年2月28日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

基于SURE/PURE准则的图像盲反卷积算法研究

国家自然科学基金

3+阅读 · 2013年12月31日

基于高光谱数据的交叉定标光谱特性差异订正

国家自然科学基金

0+阅读 · 2013年12月31日

信息与能量同时传输的多用户系统能效理论及优化方法

国家自然科学基金

0+阅读 · 2013年12月31日

基于量子点填充的光子晶体光纤多参量荧光温度传感器研究

国家自然科学基金

0+阅读 · 2012年12月31日

面向物联网的无源UHF RFID系统传播模型及识别范围预测研究

国家自然科学基金

0+阅读 · 2012年12月31日

路易斯碱催化的贫电子烯（炔）烃环加成反应的机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

量子discord及其在量子计算中的研究

国家自然科学基金

1+阅读 · 2011年12月31日

高温质子交换膜燃料电池性能衰减机理研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于网络编码的无线网状网路由技术研究

国家自然科学基金

0+阅读 · 2010年12月31日

基于网络编码的大规模无线网络容量分析

国家自然科学基金

1+阅读 · 2009年12月31日

Theoretically Principled Federated Learning for Balancing Privacy and Utility

Arxiv

0+阅读 · 2023年5月24日

Federated Variational Inference: Towards Improved Personalization and Generalization

Arxiv

0+阅读 · 2023年5月23日

Federated Transfer-Ordered-Personalized Learning for Driver Monitoring Application

Arxiv

0+阅读 · 2023年5月22日

On the Fairness Impacts of Private Ensembles Models

Arxiv

0+阅读 · 2023年5月19日

Towards the Practical Utility of Federated Learning in the Medical Domain

Arxiv

0+阅读 · 2023年5月19日

A Survey of Federated Evaluation in Federated Learning

Arxiv

0+阅读 · 2023年5月19日

Semi-verified PAC Learning from the Crowd

Arxiv

0+阅读 · 2023年5月18日

Federated Causal Inference in Heterogeneous Observational Data

Arxiv

24+阅读 · 2021年8月10日

Data-Free Knowledge Distillation for Heterogeneous Federated Learning

Arxiv

12+阅读 · 2021年6月9日

Characterizing Impacts of Heterogeneity in Federated Learning upon Large-Scale Smartphone Data

Arxiv

12+阅读 · 2021年2月21日

VIP会员

文章信息

相关主题

最新内容

《廉价自杀式无人机战争的军事战略影响：乌克兰和伊朗案例研究》

《廉价自杀式无人机战争的军事战略影响：乌克兰和伊朗案例研究》

专知会员服务

1+阅读 · 今天2:06

《面向反无人机作战的联邦式可解释射频–光电/红外情报融合：边缘人工智能优化、电子战韧性及分布式监视验证》

《面向反无人机作战的联邦式可解释射频–光电/红外情报融合：边缘人工智能优化、电子战韧性及分布式监视验证》

专知会员服务

1+阅读 · 今天1:37

ICML 2026 | FR3D：解耦自车运动的未来动态三维重建世界模型

ICML 2026 | FR3D：解耦自车运动的未来动态三维重建世界模型

专知会员服务

3+阅读 · 6月17日

【伯克利博士论文】迈向可扩展与自我演进的大语言模型智能体

【伯克利博士论文】迈向可扩展与自我演进的大语言模型智能体

专知会员服务

3+阅读 · 6月17日

学习数据的几何：形状空间分析数学综述

学习数据的几何：形状空间分析数学综述

专知会员服务

4+阅读 · 6月17日

《现代防空系统综述：架构、传感器、拦截器及新兴威胁环境对基础设施受限防御环境的影响》2026最新长综述

《现代防空系统综述：架构、传感器、拦截器及新兴威胁环境对基础设施受限防御环境的影响》2026最新长综述

专知会员服务

6+阅读 · 6月17日

定向能反无人机系统最新发展动态

定向能反无人机系统最新发展动态

专知会员服务

6+阅读 · 6月17日

从燃煤战舰到算法战争：水面指挥的永恒要求

从燃煤战舰到算法战争：水面指挥的永恒要求

专知会员服务

3+阅读 · 6月17日

《短程弹道再入飞行器拦截时间中的一项异常现象》

《短程弹道再入飞行器拦截时间中的一项异常现象》

专知会员服务

4+阅读 · 6月17日

《基于回归方法与任务上下文的对抗环境动态战术网络报文优先级排序》

《基于回归方法与任务上下文的对抗环境动态战术网络报文优先级排序》

专知会员服务

4+阅读 · 6月17日

美智库《战术级指挥控制的迫切要求：构建弹性机动式指挥控制网络》报告

美智库《战术级指挥控制的迫切要求：构建弹性机动式指挥控制网络》报告

专知会员服务

4+阅读 · 6月17日

《韩国国防政策与军备出口：韩国安全与国防政策如何塑造其国防工业与军备出口格局》最新100页报告

《韩国国防政策与军备出口：韩国安全与国防政策如何塑造其国防工业与军备出口格局》最新100页报告

专知会员服务

3+阅读 · 6月17日

ICML 2026 | VOTP：用视频基础模型与最优传输，让离线偏好强化学习只需少量反馈

ICML 2026 | VOTP：用视频基础模型与最优传输，让离线偏好强化学习只需少量反馈

专知会员服务

6+阅读 · 6月16日

多模态代码智能综述：从视觉输入到可执行代码系统

多模态代码智能综述：从视觉输入到可执行代码系统

专知会员服务

8+阅读 · 6月16日

美国马六甲“三重网”概念：安全网、威慑网与杀伤网

美国马六甲“三重网”概念：安全网、威慑网与杀伤网

专知会员服务

6+阅读 · 6月16日

相关VIP内容

「联邦学习模型安全与隐私」研究进展

「联邦学习模型安全与隐私」研究进展

专知会员服务

69+阅读 · 2022年9月24日

【KDD22】DICE: 域攻击不变的因果学习以保护数据隐私、提升攻击迁移性和对抗鲁棒性

【KDD22】DICE: 域攻击不变的因果学习以保护数据隐私、提升攻击迁移性和对抗鲁棒性

专知会员服务

12+阅读 · 2022年8月27日

【CVPR2021】用随机标签的神经架构搜索

专知会员服务

12+阅读 · 2021年3月21日

最新《联邦学习Federated Learning》报告，Federated Learning

最新《联邦学习Federated Learning》报告，Federated Learning

专知会员服务

92+阅读 · 2020年12月2日

【ICML2020】噪声在随机梯度下降中的泛化效益，On the Generalization Benefit of Noise in Stochastic Gradient Descent

【ICML2020】噪声在随机梯度下降中的泛化效益，On the Generalization Benefit of Noise in Stochastic Gradient Descent

专知会员服务

19+阅读 · 2020年6月29日

【综述】联邦学习的威胁，Threats to Federated Learning: A Survey

【综述】联邦学习的威胁，Threats to Federated Learning: A Survey

专知会员服务

81+阅读 · 2020年3月4日

【香港科技大学】联邦半监督学习综述，A Survey on Federated Semi-supervised Learning

【香港科技大学】联邦半监督学习综述，A Survey on Federated Semi-supervised Learning

专知会员服务

89+阅读 · 2020年2月28日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《面向反无人机作战的联邦式可解释射频–光电/红外情报融合：边缘人工智能优化、电子战韧性及分布式监视验证》

【伯克利博士论文】迈向可扩展与自我演进的大语言模型智能体

《廉价自杀式无人机战争的军事战略影响：乌克兰和伊朗案例研究》

ICML 2026 | FR3D：解耦自车运动的未来动态三维重建世界模型

相关资讯

从NeurIPS 2022看域泛化：大规模实验分析和模型平均

从NeurIPS 2022看域泛化：大规模实验分析和模型平均

PaperWeekly

0+阅读 · 2022年10月23日

最新《联邦学习Federated Learning》报告，47页ppt

最新《联邦学习Federated Learning》报告，47页ppt

专知

48+阅读 · 2020年12月2日

模型攻击：鲁棒性联邦学习研究的最新进展

模型攻击：鲁棒性联邦学习研究的最新进展

机器之心

35+阅读 · 2020年6月3日

【香港科技大学】联邦半监督学习综述，A Survey on Federated Semi-supervised Learning

【香港科技大学】联邦半监督学习综述，A Survey on Federated Semi-supervised Learning

专知

20+阅读 · 2020年2月28日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Theoretically Principled Federated Learning for Balancing Privacy and Utility

Arxiv

0+阅读 · 2023年5月24日

Federated Variational Inference: Towards Improved Personalization and Generalization

Arxiv

0+阅读 · 2023年5月23日

Federated Transfer-Ordered-Personalized Learning for Driver Monitoring Application

Arxiv

0+阅读 · 2023年5月22日

On the Fairness Impacts of Private Ensembles Models

Arxiv

0+阅读 · 2023年5月19日

Towards the Practical Utility of Federated Learning in the Medical Domain

Arxiv

0+阅读 · 2023年5月19日

A Survey of Federated Evaluation in Federated Learning

Arxiv

0+阅读 · 2023年5月19日

Semi-verified PAC Learning from the Crowd

Arxiv

0+阅读 · 2023年5月18日

Federated Causal Inference in Heterogeneous Observational Data

Arxiv

24+阅读 · 2021年8月10日

Data-Free Knowledge Distillation for Heterogeneous Federated Learning

Arxiv

12+阅读 · 2021年6月9日

Characterizing Impacts of Heterogeneity in Federated Learning upon Large-Scale Smartphone Data

Arxiv

12+阅读 · 2021年2月21日

相关基金

基于SURE/PURE准则的图像盲反卷积算法研究

国家自然科学基金

3+阅读 · 2013年12月31日

基于高光谱数据的交叉定标光谱特性差异订正

国家自然科学基金

0+阅读 · 2013年12月31日

信息与能量同时传输的多用户系统能效理论及优化方法

国家自然科学基金

0+阅读 · 2013年12月31日

基于量子点填充的光子晶体光纤多参量荧光温度传感器研究

国家自然科学基金

0+阅读 · 2012年12月31日

面向物联网的无源UHF RFID系统传播模型及识别范围预测研究

国家自然科学基金

0+阅读 · 2012年12月31日

路易斯碱催化的贫电子烯（炔）烃环加成反应的机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

量子discord及其在量子计算中的研究

国家自然科学基金

1+阅读 · 2011年12月31日

高温质子交换膜燃料电池性能衰减机理研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于网络编码的无线网状网路由技术研究

国家自然科学基金

0+阅读 · 2010年12月31日

基于网络编码的大规模无线网络容量分析

国家自然科学基金

1+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员