The rapid adoption of large language models (LLMs) presents new challenges for existing network architectures due to significant peak traffic and high communication uncertainty. Traditional wireless networks struggle to support efficiently, leading to intolerable response delays, disconnections, and resource wastage. To address these issues, we propose LLM-Slice, the first system to provide dedicated communication slices for LLMs within a wireless network environment. By creating LLM-specific network slices, LLM-Slice efficiently binds services with communication resources. Based on user equipment (UE) requests and a permissions database, the system registers specific slices to offer controllable LLM services, integrating a downlink resource control module to optimize response speed, enhance resource utilization, and reduce disconnections. By deploying and validating in a real UE-gNB-CN environment, numerical results demonstrate that LLM-Slice significantly improves response speed and resource efficiency, providing a novel solution for fast and controllable LLM access in wireless networks.
翻译:大型语言模型(LLM)的快速普及因其显著的峰值流量和高度不确定的通信需求,对现有网络架构提出了新的挑战。传统无线网络难以有效支撑,导致不可容忍的响应延迟、连接中断及资源浪费。为解决这些问题,本文提出LLM-Slice,这是首个在无线网络环境中为LLM提供专用通信切片的系统。通过创建面向LLM的专用网络切片,LLM-Slice实现了服务与通信资源的高效绑定。该系统基于用户设备(UE)请求及权限数据库注册特定切片以提供可控的LLM服务,并集成下行资源控制模块以优化响应速度、提升资源利用率并减少连接中断。通过在真实UE-gNB-CN环境中部署与验证,数值结果表明LLM-Slice能显著提升响应速度与资源效率,为无线网络中快速可控的LLM接入提供了创新解决方案。