Federated Recommendation Systems (FRS) enable privacy-preserving model training by keeping user data on edge devices. However, the practical deployment of FRS in Edge-Cloud environments faces significant challenges due to system and statistical heterogeneity. Existing FRS participant selection strategies struggle to dynamically balance the trade-off between model convergence speed and recommendation quality in such volatile environments. To address this, we formulate the FRS participant selection problem as a normalized utility cost addressing the model quality and system efficiency. Next, we propose a dynamic participant selection framework incorporating a Multi-Armed Bandit (MAB)-based solver for multimodal FRS. We design a client-utility function that jointly evaluates historical Client Performance Reputation, data quality, and real-time system latency. By leveraging an Upper Confidence Bound strategy, our framework effectively balances the exploration of under-sampled clients with the exploitation of high-performing ones. We validate the proposed approach on a realistic edge-cloud testbed implementation using a multimodal movie-recommendation task. Experimental results demonstrate that our MAB-driven approach outperforms other baselines across eight different data-skew scenarios. Specifically, it improves training efficiency by 32-50% while improving model quality metrics such as Recall@50 by up to around 5%
翻译:联邦推荐系统(FRS)通过将用户数据保留在边缘设备上,实现隐私保护的模型训练。然而,在边缘-云环境中实际部署FRS面临系统异构性与统计异构性的重大挑战。现有FRS参与者选择策略难以在动态变化的此类环境中平衡模型收敛速度与推荐质量的权衡。为此,我们将FRS参与者选择问题建模为归一化效用成本,以同时考虑模型质量与系统效率。进而,我们提出一种动态参与者选择框架,该框架集成了基于多臂老虎机(MAB)的求解器,适用于多模态FRS。我们设计了一个客户端效用函数,联合评估历史客户端性能声誉、数据质量以及实时系统延迟。通过采用上置信界策略,我们的框架有效平衡了对采样不足客户端的探索与对高性能客户端的利用。我们在真实边缘-云测试平台上,使用多模态电影推荐任务验证了所提方法。实验结果表明,在八种不同数据偏斜场景下,我们的MAB驱动方法优于其他基线方法。具体而言,该方法将训练效率提升32-50%,同时将模型质量指标(如Recall@50)最高提升约5%。