Accuracy and efficiency remain challenges for multi-party computation (MPC) frameworks. Spin is a GPU-accelerated MPC framework that supports multiple computation parties and a dishonest majority adversarial setup. We propose optimized protocols for non-linear functions that are critical for machine learning, as well as several novel optimizations specific to attention that is the fundamental unit of Transformer models, allowing Spin to perform non-trivial CNNs training and Transformer inference without sacrificing security. At the backend level, Spin leverages GPU, CPU, and RDMA-enabled smart network cards for acceleration. Comprehensive evaluations demonstrate that Spin can be up to $2\times$ faster than the state-of-the-art for deep neural network training. For inference on a Transformer model with 18.9 million parameters, our attention-specific optimizations enable Spin to achieve better efficiency, less communication, and better accuracy.
翻译:准确性和效率仍是多方计算(MPC)框架面临的挑战。Spin是一种GPU加速的MPC框架,支持多方计算参与者及不诚实多数对抗设置。我们针对机器学习中关键的非线性函数提出了优化协议,并针对Transformer模型基础单元——注意力机制提出多项创新优化,使Spin能在不牺牲安全性的前提下执行非平凡CNN训练和Transformer推理。在后端层面,Spin利用GPU、CPU及支持远程直接数据访问(RDMA)的智能网卡实现加速。全面评估表明,Spin在深度神经网络训练中可比现有最优方案快$2\times$倍。针对含1890万参数的Transformer模型推理,我们提出的注意力机制专用优化使Spin实现了更高效率、更少通信量及更强准确性。