This is the Replicated Computational Results (RCR) Report for the paper ``Can LLMs Hack Enterprise Networks?" The paper empirically investigates the efficacy and effectiveness of different LLMs for penetration-testing enterprise networks, i.e., Microsoft Active Directory Assumed-Breach Simulations. This RCR report describes the artifacts used in the paper, how to create an evaluation setup, and highlights the analysis scripts provided within our prototype.
翻译:本报告为论文《大型语言模型能否入侵企业网络?》的可复现计算结果报告。该论文通过实证研究,评估了不同LLM在企业网络渗透测试(即微软Active Directory假定违规模拟)中的效能与有效性。本RCR报告详细说明了论文中使用的实验构件、评估环境的搭建方法,并重点介绍了我们原型系统中提供的分析脚本。