Machine learning (ML) is increasingly being deployed in critical systems. The data dependence of ML makes securing data used to train and test ML-enabled systems of utmost importance. While the field of cybersecurity has well-established practices for securing information, ML-enabled systems create new attack vectors. Furthermore, data science and cybersecurity domains adhere to their own set of skills and terminologies. This survey aims to present background information for experts in both domains in topics such as cryptography, access control, zero trust architectures, homomorphic encryption, differential privacy for machine learning, and federated learning to establish shared foundations and promote advancements in data security.
翻译:机器学习(ML)正日益部署于关键系统中。ML对数据的依赖性使得保障用于训练和测试ML赋能系统的数据安全至关重要。尽管网络安全领域已建立成熟的信息安全实践,但ML赋能系统却催生了新的攻击向量。此外,数据科学与网络安全领域遵循各自独立的技能体系与术语规范。本综述旨在为两个领域的专家提供密码学、访问控制、零信任架构、同态加密、针对机器学习的差分隐私以及联邦学习等主题的背景知识,以建立共同基础并推动数据安全领域的进步。