Federated Nearest Neighbor Machine Translation

To protect user privacy and meet legal regulations, federated learning (FL) is attracting significant attention. Training neural machine translation (NMT) models with traditional FL algorithm (e.g., FedAvg) typically relies on multi-round model-based interactions. However, it is impractical and inefficient for machine translation tasks due to the vast communication overheads and heavy synchronization. In this paper, we propose a novel federated nearest neighbor (FedNN) machine translation framework that, instead of multi-round model-based interactions, leverages one-round memorization-based interaction to share knowledge across different clients to build low-overhead privacy-preserving systems. The whole approach equips the public NMT model trained on large-scale accessible data with a $k$-nearest-neighbor ($$kNN) classifier and integrates the external datastore constructed by private text data in all clients to form the final FL model. A two-phase datastore encryption strategy is introduced to achieve privacy-preserving during this process. Extensive experiments show that FedNN significantly reduces computational and communication costs compared with FedAvg, while maintaining promising performance in different FL settings.

翻译：为保护用户隐私并满足法律法规要求，联邦学习（FL）正受到广泛关注。采用传统联邦学习算法（如FedAvg）训练神经机器翻译（NMT）模型通常依赖于多轮基于模型的交互。然而，由于巨大的通信开销和繁重的同步负担，这种方法在机器翻译任务中既不实际也不高效。本文提出了一种新颖的联邦最近邻（FedNN）机器翻译框架，该框架摒弃了多轮基于模型的交互，转而利用单轮基于记忆的交互在不同客户端之间共享知识，从而构建低开销的隐私保护系统。该方法为在大规模可访问数据上训练的公开NMT模型配备了$k$-最近邻（$k$NN）分类器，并整合了所有客户端私有文本数据构建的外部数据存储，形成最终的联邦学习模型。在此过程中引入了一种两阶段数据存储加密策略，以实现隐私保护。大量实验表明，与FedAvg相比，FedNN在显著降低计算和通信成本的同时，在不同联邦学习设置下仍保持了优秀的性能表现。

相关内容

Machine Translation

关注 210

机器翻译（Machine Translation）涵盖计算语言学和语言工程的所有分支，包含多语言方面。特色论文涵盖理论，描述或计算方面的任何下列主题:双语和多语语料库的编写和使用，计算机辅助语言教学，非罗马字符集的计算含义，连接主义翻译方法，对比语言学等。官网地址：http://dblp.uni-trier.de/db/journals/mt/

百篇论文纵览大型语言模型最新研究进展

专知会员服务

70+阅读 · 2023年3月31日

NeurlPS 2022 | 自然语言处理相关论文分类整理

专知会员服务

51+阅读 · 2022年10月2日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日