ScalO-RAN: Energy-aware Network Intelligence Scaling in Open RAN

Network virtualization, software-defined infrastructure, and orchestration are pivotal elements in contemporary networks, yielding new vectors for optimization and novel capabilities. In line with these principles, O-RAN presents an avenue to bypass vendor lock-in, circumvent vertical configurations, enable network programmability, and facilitate integrated Artificial Intelligence (AI) support. Moreover, modern container orchestration frameworks (e.g., Kubernetes, Red Hat OpenShift) simplify the way cellular base stations, as well as the newly introduced RAN Intelligent Controllers (RICs), are deployed, managed, and orchestrated. While this enables cost reduction via infrastructure sharing, it also makes it more challenging to meet O-RAN control latency requirements, especially during peak resource utilization. To address this problem, we propose ScalO-RAN, a control framework rooted in optimization and designed as an O-RAN rApp that allocates and scales AI-based O-RAN applications (xApps, rApps, dApps) to: (i) abide by application-specific latency requirements, and (ii) monetize the shared infrastructure while reducing energy consumption. We prototype ScalO-RAN on an OpenShift cluster with base stations, RIC, and a set of AI-based xApps deployed as micro-services. We evaluate ScalO-RAN both numerically and experimentally. Our results show that ScalO-RAN can optimally allocate and distribute O-RAN applications within available computing nodes to accommodate even stringent latency requirements. More importantly, we show that scaling O-RAN applications is primarily a time-constrained problem rather than a resource-constrained one, where scaling policies must account for stringent inference time of AI applications, and not only how many resources they consume.

翻译：网络虚拟化、软件定义基础设施及编排是当代网络的关键要素，为优化和新能力提供了新的维度。基于这些原则，O-RAN提供了一条途径，可规避供应商锁定、消除垂直配置、实现网络可编程性并促进集成式人工智能（AI）支持。此外，现代容器编排框架（如Kubernetes、Red Hat OpenShift）简化了蜂窝基站以及新引入的RAN智能控制器（RIC）的部署、管理与编排方式。虽然这通过基础设施共享实现了成本降低，但也使得满足O-RAN控制时延要求（特别是在资源利用率峰值期间）变得更加具有挑战性。为解决这一问题，我们提出ScalO-RAN——一种基于优化理论设计的控制框架，作为O-RAN rApp来分配和扩展基于AI的O-RAN应用（xApp、rApp、dApp），以实现：(i) 满足应用特定的时延要求，(ii) 在降低能耗的同时实现共享基础设施的货币化。我们在一个包含基站、RIC及一组以微服务形式部署的基于AI的xApp的OpenShift集群上对ScalO-RAN进行了原型实现。我们通过数值仿真和实验两种方式评估了ScalO-RAN。结果表明，ScalO-RAN能够在可用计算节点内优化分配和分布O-RAN应用，即使面对严苛的时延要求也能满足。更为重要的是，我们证明O-RAN应用的伸缩主要是一个时间约束问题而非资源约束问题——伸缩策略必须考虑AI应用严格的推理时间，而不仅仅是它们消耗多少资源。