Secure computation is of critical importance to not only the DoD, but across financial institutions, healthcare, and anywhere personally identifiable information (PII) is accessed. Traditional security techniques require data to be decrypted before performing any computation. When processed on untrusted systems the decrypted data is vulnerable to attacks to extract the sensitive information. To address these vulnerabilities Fully Homomorphic Encryption (FHE) keeps the data encrypted during computation and secures the results, even in these untrusted environments. However, FHE requires a significant amount of computation to perform equivalent unencrypted operations. To be useful, FHE must significantly close the computation gap (within 10x) to make encrypted processing practical. To accomplish this ambitious goal the TREBUCHET project is leading research and development in FHE processing hardware to accelerate deep computations on encrypted data, as part of the DARPA MTO Data Privacy for Virtual Environments (DPRIVE) program. We accelerate the major secure standardized FHE schemes (BGV, BFV, CKKS, FHEW, etc.) at >=128-bit security while integrating with the open-source PALISADE and OpenFHE libraries currently used in the DoD and in industry. We utilize a novel tile-based chip design with highly parallel ALUs optimized for vectorized 128b modulo arithmetic. The TREBUCHET coprocessor design provides a highly modular, flexible, and extensible FHE accelerator for easy reconfiguration, deployment, integration and application on other hardware form factors, such as System-on-Chip or alternate chip areas.
翻译:摘要:安全计算不仅对国防部至关重要,在金融机构、医疗保健以及任何涉及个人身份信息(PII)访问的领域也同样关键。传统安全技术要求在执行任何计算前先解密数据。当在不可信系统上处理时,解密后的数据容易遭受攻击以提取敏感信息。为解决这些漏洞,全同态加密(FHE)可在计算过程中保持数据加密状态,即使在不可信环境下也能确保结果安全。然而,FHE需要显著的计算量才能执行等效的未加密操作。为使其具备实用性,必须大幅缩小计算差距(在10倍以内),使加密处理变得可行。为实现这一宏伟目标,TREBUCHET项目正主导FHE处理硬件的研发,以加速加密数据上的深度计算,作为DARPA MTO虚拟环境数据隐私(DPRIVE)计划的一部分。我们基于≥128位安全强度,加速了主要的标准化FHE安全方案(BGV、BFV、CKKS、FHEW等),同时集成了国防部和工业界当前使用的开源PALISADE与OpenFHE库。我们采用创新的基于瓦片的芯片设计,配备高度并行的ALU,针对向量化128位模运算进行了优化。TREBUCHET协处理器设计提供了一种高度模块化、灵活且可扩展的FHE加速器,便于在其他硬件形态(如片上系统或替代芯片区域)上进行轻松重构、部署、集成和应用。