Where is the Testbed for my Federated Learning Research?

Progressing beyond centralized AI is of paramount importance, yet, distributed AI solutions, in particular various federated learning (FL) algorithms, are often not comprehensively assessed, which prevents the research community from identifying the most promising approaches and practitioners from being convinced that a certain solution is deployment-ready. The largest hurdle towards FL algorithm evaluation is the difficulty of conducting real-world experiments over a variety of FL client devices and different platforms, with different datasets and data distribution, all while assessing various dimensions of algorithm performance, such as inference accuracy, energy consumption, and time to convergence, to name a few. In this paper, we present CoLExT, a real-world testbed for FL research. CoLExT is designed to streamline experimentation with custom FL algorithms in a rich testbed configuration space, with a large number of heterogeneous edge devices, ranging from single-board computers to smartphones, and provides real-time collection and visualization of a variety of metrics through automatic instrumentation. According to our evaluation, porting FL algorithms to CoLExT requires minimal involvement from the developer, and the instrumentation introduces minimal resource usage overhead. Furthermore, through an initial investigation involving popular FL algorithms running on CoLExT, we reveal previously unknown trade-offs, inefficiencies, and programming bugs.

翻译：超越集中式人工智能至关重要，然而分布式人工智能解决方案，特别是各类联邦学习算法，往往未能得到全面评估，这阻碍了研究界识别最有前景的方法，也使从业者难以确信特定解决方案已具备部署条件。联邦学习算法评估的最大障碍在于难以在多样化联邦学习客户端设备与不同平台上开展真实世界实验，这些实验需涵盖不同数据集与数据分布，同时评估算法性能的多个维度，例如推理准确率、能耗及收敛时间等。本文提出CoLExT——一个面向联邦学习研究的真实世界测试平台。CoLExT旨在通过丰富的测试配置空间简化自定义联邦学习算法的实验流程，该平台配备大量异构边缘设备（从单板计算机到智能手机），并通过自动化检测工具实现多种指标的实时收集与可视化。评估表明，将联邦学习算法移植至CoLExT仅需开发者极少量介入，且检测机制引入的资源开销微乎其微。此外，通过在CoLExT上运行主流联邦学习算法的初步研究发现，我们揭示了先前未知的权衡关系、效率缺陷及程序错误。

相关内容

联邦学习

关注 200

联邦学习（Federated Learning）是一种新兴的人工智能基础技术，在 2016 年由谷歌最先提出，原本用于解决安卓手机终端用户在本地更新模型的问题，其设计目标是在保障大数据交换时的信息安全、保护终端数据和个人数据隐私、保证合法合规的前提下，在多参与方或多计算结点之间开展高效率的机器学习。其中，联邦学习可使用的机器学习算法不局限于神经网络，还包括随机森林等重要算法。联邦学习有望成为下一代人工智能协同算法和协作网络的基础。

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日