Automated Security Response through Online Learning with Adaptive Conjectures

from arxiv, This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

We study automated security response for an IT infrastructure and formulate the interaction between an attacker and a defender as a partially observed, non-stationary game. We relax the standard assumption that the game model is correctly specified and consider that each player has a probabilistic conjecture about the model, which may be misspecified in the sense that the true model has probability 0. This formulation allows us to capture uncertainty about the infrastructure and the intents of the players. To learn effective game strategies online, we design a novel method where a player iteratively adapts its conjecture using Bayesian learning and updates its strategy through rollout. We prove that the conjectures converge to best fits, and we provide a bound on the performance improvement that rollout enables with a conjectured model. To characterize the steady state of the game, we propose a variant of the Berk-Nash equilibrium. We present our method through an advanced persistent threat use case. Simulation studies based on testbed measurements show that our method produces effective security strategies that adapt to a changing environment. We also find that our method enables faster convergence than current reinforcement learning techniques.

翻译：我们研究面向IT基础设施的自动化安全响应，并将攻击者与防御者之间的交互建模为部分可观测的非平稳博弈。我们放宽了博弈模型需正确设定的标准假设，考虑每个参与者对模型持有概率性猜想，该猜想可能被错误指定（即真实模型概率为零）。这一建模方式使我们能够刻画基础设施的不确定性以及参与者的意图。为了在线学习有效的博弈策略，我们设计了一种新颖方法：参与者通过贝叶斯学习迭代调整其猜想，并通过滚动时域控制更新策略。我们证明猜想将收敛至最优拟合，并给出了基于猜想模型进行滚动控制所能实现的性能改进上界。为表征博弈的稳态，我们提出了Berk-Nash均衡的变体。通过高级持续性威胁用例展示了该方法。基于测试平台测量的仿真研究表明，我们的方法能产生适应动态环境的有效安全策略，且与传统强化学习方法相比具有更快的收敛速度。

相关内容

Automator

关注 5

Automator是苹果公司为他们的Mac OS X系统开发的一款软件。 只要通过点击拖拽鼠标等操作就可以将一系列动作组合成一个工作流，从而帮助你自动的（可重复的）完成一些复杂的工作。Automator还能横跨很多不同种类的程序，包括：查找器、Safari网络浏览器、iCal、地址簿或者其他的一些程序。它还能和一些第三方的程序一起工作，如微软的Office、Adobe公司的Photoshop或者Pixelmator等。

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日