Calibrated One Round Federated Learning with Bayesian Inference in the Predictive Space

Federated Learning (FL) involves training a model over a dataset distributed among clients, with the constraint that each client's dataset is localized and possibly heterogeneous. In FL, small and noisy datasets are common, highlighting the need for well-calibrated models that represent the uncertainty of predictions. The closest FL techniques to achieving such goals are the Bayesian FL methods which collect parameter samples from local posteriors, and aggregate them to approximate the global posterior. To improve scalability for larger models, one common Bayesian approach is to approximate the global predictive posterior by multiplying local predictive posteriors. In this work, we demonstrate that this method gives systematically overconfident predictions, and we remedy this by proposing $\beta$-Predictive Bayes, a Bayesian FL algorithm that interpolates between a mixture and product of the predictive posteriors, using a tunable parameter $\beta$. This parameter is tuned to improve the global ensemble's calibration, before it is distilled to a single model. Our method is evaluated on a variety of regression and classification datasets to demonstrate its superiority in calibration to other baselines, even as data heterogeneity increases. Code available at https://github.com/hasanmohsin/betaPredBayes_FL

翻译：联邦学习（FL）涉及在分布在客户端的数据集上训练模型，且每个客户端的数据集具有本地化且可能异构的约束。在FL中，小规模且有噪声的数据集很常见，这凸显了对能够表示预测不确定性的良好校准模型的需求。最接近实现此类目标的FL技术是贝叶斯FL方法，这些方法从本地后验中收集参数样本，并聚合它们以近似全局后验。为了提高较大模型的可扩展性，一种常见的贝叶斯方法是通过相乘本地预测后验来近似全局预测后验。在本工作中，我们证明该方法会系统性地给出过度自信的预测，并通过提出$\beta$-预测贝叶斯（一种FL贝叶斯算法）来纠正这一问题，该算法通过可调参数$\beta$在预测后验的混合与乘积之间进行插值。该参数经过调优以改善全局集成的校准，随后将其蒸馏为单一模型。我们在多种回归和分类数据集上评估了该方法，证明即使数据异构性增加，其在校准方面也优于其他基线。代码见https://github.com/hasanmohsin/betaPredBayes_FL

相关内容

MoDELS

关注 46

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【亚马逊-WWW2020】不解析,生成!用于面向任务的语义分析的序列到序列体系结构，Don't Parse, Generate! A Sequence to Sequence Architecture for Task-Oriented Semantic Parsing

专知会员服务

15+阅读 · 2020年2月1日

【WSDM2020】超越统计关系：将知识关系整合到多标签音乐风格分类的风格关联中（附pdf）

专知会员服务

18+阅读 · 2019年11月23日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

35+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日