Landseer: Exploring the Machine Learning Defense Landscape

Machine learning systems face diverse threats that undermine robustness, privacy, and fairness. Although many defenses have been proposed, each typically addresses a single risk in isolation. Real-world deployments, however, require these defenses to be composed to meet multiple guarantees simultaneously. The process of composing defenses is complex and not well understood, and its impact on performance and security remains unclear. We present Landseer, a modular framework for integrating machine learning (ML) defenses into the ML lifecycle and systematically evaluating their composition. Landseer encapsulates defenses as containerized modules, allowing existing and new techniques to be plugged in with minimal effort. Its evaluation engine automates experiments across multiple metrics, supporting the study of defenses both individually and in combination. In a preliminary study, we identified 35 state-of-the-art machine learning defenses. After filtering for reproducibility, we analyzed their performance using Landseer's unified evaluation process. Our findings reveal gaps in replicability across defense families and provide insights into the challenges and opportunities in integrating multiple defenses, establishing a foundation for improving the reliability of machine learning systems.

翻译：机器学习系统面临着破坏鲁棒性、隐私性和公平性的多种威胁。尽管已提出众多防御机制，但每种通常仅孤立地应对单一风险。然而，实际部署需要组合这些防御机制以同时满足多重保障。防御机制的组合过程复杂且尚未被充分理解，其对性能和安全性的影响仍不明确。我们提出了Landseer，一个用于将机器学习（ML）防御集成到ML生命周期中并系统评估其组合效果的模块化框架。Landseer将防御机制封装为容器化模块，使现有及新技术能够以最小代价即插即用。其评估引擎可自动化执行跨多指标的实验，支持对防御机制进行单独及组合研究。在初步研究中，我们识别了35种前沿的机器学习防御机制。经可复现性筛选后，我们利用Landseer的统一评估流程分析了其性能。我们的发现揭示了不同防御族在可复现性方面的差距，并为整合多种防御机制面临的挑战与机遇提供了洞见，为提升机器学习系统的可靠性奠定了基础。

相关内容

Machine Learning

关注 2251

机器学习（Machine Learning）是一个研究计算学习方法的国际论坛。该杂志发表文章，报告广泛的学习方法应用于各种学习问题的实质性结果。该杂志的特色论文描述研究的问题和方法，应用研究和研究方法的问题。有关学习问题或方法的论文通过实证研究、理论分析或与心理现象的比较提供了坚实的支持。应用论文展示了如何应用学习方法来解决重要的应用问题。研究方法论文改进了机器学习的研究方法。所有的论文都以其他研究人员可以验证或复制的方式描述了支持证据。论文还详细说明了学习的组成部分，并讨论了关于知识表示和性能任务的假设。官网地址：http://dblp.uni-trier.de/db/journals/ml/

【MIT博士论文】从数据到模型，再回到数据：构建可预测且可靠的机器学习系统”

专知会员服务

23+阅读 · 2025年6月19日

《机器学习为军事战术行动提供安全保障》

专知会员服务

25+阅读 · 2024年8月8日

【2023新书】网络安全中的对抗性深度学习:攻击分类，防御机制和学习理论

专知会员服务

52+阅读 · 2023年3月16日

《美国防部对抗性机器学习》34页slides，卡内基梅隆大学

专知会员服务

66+阅读 · 2022年11月12日