Cellular networks are critical infrastructure supporting billions of worldwide users and safety- and mission-critical services. Vulnerabilities in cellular networks can therefore cause service disruption, privacy breaches, and broad societal harm, motivating growing efforts to analyze 3GPP specifications that define required device and operator behavior. While large language models (LLMs) have demonstrated the capability for reading technical documents, cellular specifications impose unique challenges: faithful interpretation of normative language, reasoning across cross-referenced clauses, and verifiable conclusions grounded in multimodal evidence such as tables and figures. To address these challenges, we propose CellSpecSec-ARI, a unified Adapt-Retrieve-Integrate framework for systematic understanding and standard-driven security analysis of 3GPP specifications; CellularSpecSec-Bench, a staged benchmark, containing newly constructed high-quality datasets with expert-verified and corrected subsets from prior open-source resources. Together, they establish an accessible and reproducible foundation for quantifying progress in specification understanding and security reasoning in the cellular network security domain.
翻译:蜂窝网络是支撑全球数十亿用户及安全关键型与任务关键型服务的关键基础设施。因此,蜂窝网络中的漏洞可能导致服务中断、隐私泄露及广泛的社会危害,这推动了对定义设备与运营商行为要求的3GPP规范进行分析的日益增长的研究努力。尽管大语言模型(LLMs)已展现出阅读技术文档的能力,但蜂窝网络规范提出了独特的挑战:对规范性语言的忠实解读、跨交叉引用条款的推理,以及基于表格与图形等多模态证据的可验证结论。为应对这些挑战,我们提出CellSpecSec-ARI,一个统一的适应-检索-整合框架,用于对3GPP规范进行系统性理解与标准驱动的安全分析;以及CellularSpecSec-Bench,一个分级基准,包含新构建的高质量数据集,其中包含来自先前开源资源的经专家验证与修正的子集。二者共同为量化蜂窝网络安全领域中规范理解与安全推理的进展,建立了一个可访问且可复现的基础。