Intelligible Protocol Learning for Resource Allocation in 6G O-RAN Slicing

An adaptive standardized protocol is essential for addressing inter-slice resource contention and conflict in network slicing. Traditional protocol standardization is a cumbersome task that yields hardcoded predefined protocols, resulting in increased costs and delayed rollout. Going beyond these limitations, this paper proposes a novel multi-agent deep reinforcement learning (MADRL) communication framework called standalone explainable protocol (STEP) for future sixth-generation (6G) open radio access network (O-RAN) slicing. As new conditions arise and affect network operation, resource orchestration agents adapt their communication messages to promote the emergence of a protocol on-the-fly, which enables the mitigation of conflict and resource contention between network slices. STEP weaves together the notion of information bottleneck (IB) theory with deep Q-network (DQN) learning concepts. By incorporating a stochastic bottleneck layer -- inspired by variational autoencoders (VAEs) -- STEP imposes an information-theoretic constraint for emergent inter-agent communication. This ensures that agents exchange concise and meaningful information, preventing resource waste and enhancing the overall system performance. The learned protocols enhance interpretability, laying a robust foundation for standardizing next-generation 6G networks. By considering an O-RAN compliant network slicing resource allocation problem, a conflict resolution protocol is developed. In particular, the results demonstrate that, on average, STEP reduces inter-slice conflicts by up to 6.06x compared to a predefined protocol method. Furthermore, in comparison with an MADRL baseline, STEP achieves 1.4x and 3.5x lower resource underutilization and latency, respectively.

翻译：在网络切片中，一种自适应的标准化协议对于解决切片间的资源争用与冲突至关重要。传统的协议标准化是一项繁琐的任务，其产生的是硬编码的预定义协议，导致成本增加和部署延迟。为了突破这些限制，本文提出了一种新颖的多智能体深度强化学习（MADRL）通信框架，称为独立可解释协议（STEP），用于未来的第六代（6G）开放无线接入网（O-RAN）切片。当新的条件出现并影响网络运行时，资源编排智能体会自适应地调整其通信消息，以促进协议在运行中即时涌现，从而能够缓解网络切片之间的冲突和资源争用。STEP将信息瓶颈（IB）理论与深度Q网络（DQN）学习概念交织在一起。通过引入一个受变分自编码器（VAE）启发的随机瓶颈层，STEP对智能体间涌现的通信施加了信息论约束。这确保了智能体交换简洁且有意义的信息，防止资源浪费并提升整体系统性能。学习到的协议增强了可解释性，为下一代6G网络的标准化奠定了坚实基础。通过考虑一个符合O-RAN规范的网络切片资源分配问题，本文开发了一种冲突解决协议。具体而言，结果表明，与预定义协议方法相比，STEP平均能将切片间冲突减少高达6.06倍。此外，与一个MADRL基线方法相比，STEP分别实现了1.4倍和3.5倍更低的资源利用不足和延迟。

相关内容

Networking

关注 23

Networking：IFIP International Conferences on Networking。 Explanation：国际网络会议。 Publisher：IFIP。 SIT： http://dblp.uni-trier.de/db/conf/networking/index.html

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

【亚马逊-WWW2020】不解析,生成!用于面向任务的语义分析的序列到序列体系结构，Don't Parse, Generate! A Sequence to Sequence Architecture for Task-Oriented Semantic Parsing

专知会员服务

15+阅读 · 2020年2月1日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日