The emergence of the semantic-aware paradigm presents opportunities for innovative services, especially in the context of 6G-based applications. Although significant progress has been made in semantic extraction techniques, the incorporation of semantic information into resource allocation decision-making is still in its early stages, lacking consideration of the requirements and characteristics of future systems. In response, this paper introduces a novel formulation for the problem of multiple access to the wireless spectrum. It aims to optimize the utilization-fairness trade-off, using the $\alpha$-fairness metric, while accounting for user data correlation by introducing the concepts of self- and assisted throughputs. Initially, the problem is analyzed to identify its optimal solution. Subsequently, a Semantic-Aware Multi-Agent Double and Dueling Deep Q-Learning (SAMA-D3QL) technique is proposed. This method is grounded in Model-free Multi-Agent Deep Reinforcement Learning (MADRL), enabling the user equipment to autonomously make decisions regarding wireless spectrum access based solely on their local individual observations. The efficiency of the proposed technique is evaluated through two scenarios: single-channel and multi-channel. The findings illustrate that, across a spectrum of $\alpha$ values, association matrices, and channels, SAMA-D3QL consistently outperforms alternative approaches. This establishes it as a promising candidate for facilitating the realization of future federated, dynamically evolving applications.
翻译:语义感知范式的出现为创新服务带来了机遇,尤其在6G应用背景下。尽管语义提取技术已取得显著进展,但将语义信息纳入资源分配决策仍处于初级阶段,缺乏对未来系统需求与特性的考量。为此,本文提出一种面向无线频谱多址接入问题的新型建模方法。该方法旨在利用$\alpha$公平性度量优化利用率与公平性之间的权衡,同时通过引入自吞吐量与辅助吞吐量概念考虑用户数据相关性。首先对问题进行最优解分析,随后提出语义感知多智能体双重对偶深度Q学习(SAMA-D3QL)技术。该方法基于无模型多智能体深度强化学习(MADRL),使终端设备能够仅凭局部观测自主决策无线频谱接入。通过单信道与多信道两种场景评估所提技术性能。结果表明,在不同$\alpha$值、关联矩阵和信道条件下,SAMA-D3QL均优于对比方案,成为支撑未来联邦式动态演进应用的有力候选方案。