COSMOS: Model-Agnostic Personalized Federated Learning with Clustered Server Models and Pseudo-Label-Only Communication

Federated learning (FL) in heterogeneous environments remains challenging because client models often differ in both architecture and data distribution. While recent approaches attempt to address this challenge through client clustering and knowledge distillation, simultaneously handling architectural and statistical heterogeneity remains difficult. We introduce COSMOS, a model-agnostic framework that enables server-side personalization using only pseudo-label communication. Clients train local models and predict on the public data; the server clusters clients by prediction similarity, trains a cluster-specific model for each group using its own compute, and distills the resulting models back to clients. We provide the first theoretical analysis showing that distillation from the learned cluster models can yield exponential personalization risk contraction, going beyond the convergence-to-stationarity guarantees typically provided in model-agnostic FL. Experiments across benchmarks demonstrate that COSMOS consistently outperforms all model-agnostic FL baselines while remaining competitive with state-of-the-art personalized FL methods. More broadly, our results highlight personalized server-side learning with pseudo-labels as a promising paradigm for scalable and model-agnostic federated learning in highly heterogeneous environments.

翻译：异构环境中的联邦学习仍具挑战性，因为客户端模型在架构和数据分布上常存在差异。尽管近期方法通过客户端聚类和知识蒸馏试图解决该问题，但同时处理架构异构性与统计异构性仍具难度。我们提出COSMOS——一个仅依赖伪标签通信实现服务器端个性化的模型无关框架。客户端训练本地模型并在公共数据上进行预测；服务器根据预测相似性对客户端聚类，利用自身算力为每个聚类训练特定模型，并将生成的模型蒸馏回客户端。我们首次提供理论分析，证明从学习到的聚类模型进行蒸馏可实现指数级个性化风险收缩，超越了模型无关联邦学习中常见的收敛至驻点保证。跨基准实验表明，COSMOS始终优于所有模型无关联邦学习基线，且与最先进的个性化联邦学习方法性能相当。更广泛而言，我们的结果揭示基于伪标签的个性化服务器端学习，是高度异构环境中实现可扩展且模型无关联邦学习的具有前景的范式。

相关内容

服务器

关注 14

服务器，也称伺服器，是提供计算服务的设备。由于服务器需要响应服务请求，并进行处理，因此一般来说服务器应具备承担服务并且保障服务的能力。
服务器的构成包括处理器、硬盘、内存、系统总线等，和通用的计算机架构类似，但是由于需要提供高可靠的服务，因此在处理能力、稳定性、可靠性、安全性、可扩展性、可管理性等方面要求较高。

【CVPR Highlight 2026】 VPDR：驯服噪声诱导的原型退化，实现隐私保护个性化联邦微调

专知会员服务

11+阅读 · 5月2日

异构联邦学习在无人系统中的研究综述

专知会员服务

12+阅读 · 2025年5月25日

联邦长尾学习研究综述

专知会员服务

15+阅读 · 2025年5月1日

Meta-Transformer：多模态学习的统一框架

专知会员服务

59+阅读 · 2023年7月21日