Federated Learning (FL) enables collaborative model training across distributed clients without sharing raw data, thereby preserving privacy. However, FL often suffers from significant communication and computational overhead, limiting its scalability and sustainability. In this work, we introduce a Full Compression Pipeline (FCP) for FL in communication-constrained environments. FCP integrates three complementary deep compression techniques (pruning, quantization, and Huffman encoding) into a unified end-to-end framework. By compressing local models and communication payloads, FCP substantially reduces transmission costs and resource consumption while maintaining competitive accuracy. To quantify its impact, we develop an evaluation framework that captures both communication and computation overheads as a unified model cost, allowing a holistic assessment of efficiency trade-offs. The pipeline is evaluated in an independent and identically distributed (IID) and non-IID data setting. In one representative scenario, training a ResNet-12 model on the CIFAR-10 dataset with ten clients and a 2 Mbps bandwidth, the FCP achieves more than 11$\times$ reduction in model size, with only a 2% drop in accuracy compared to the uncompressed baseline. This results in an FL training that is more than 60% faster.
翻译:联邦学习(FL)能够在分布式客户端间实现协作模型训练,无需共享原始数据,从而保护隐私。然而,FL常因显著的通信和计算开销而受限,影响其可扩展性与可持续性。本文针对通信受限环境,提出一种全压缩流水线(FCP)。FCP将三种互补的深度压缩技术(剪枝、量化和霍夫曼编码)整合为统一的端到端框架。通过压缩本地模型与通信负载,FCP在保持竞争性精度的同时,大幅降低传输成本与资源消耗。为量化其影响,我们构建了一个评估框架,将通信与计算开销统一为模型成本,从而全面权衡效率。该流水线在独立同分布(IID)与非独立同分布(non-IID)数据设置下进行评估。在一个典型场景中(基于CIFAR-10数据集,使用10个客户端和2 Mbps带宽训练ResNet-12模型),FCP实现了超过11倍的模型尺寸缩减,且相对于未压缩基线仅损失2%的精度。这使得FL训练速度提升超过60%。