Generative diffusion models (GDMs) have recently shown great success in synthesizing multimedia signals with high perceptual quality enabling highly efficient semantic communications in future wireless networks. In this paper, we develop an intent-aware generative semantic multicasting framework utilizing pre-trained diffusion models. In the proposed framework, the transmitter decomposes the source signal to multiple semantic classes based on the multi-user intent, i.e. each user is assumed to be interested in details of only a subset of the semantic classes. The transmitter then sends to each user only its intended classes, and multicasts a highly compressed semantic map to all users over shared wireless resources that allows them to locally synthesize the other classes, i.e. non-intended classes, utilizing pre-trained diffusion models. The signal retrieved at each user is thereby partially reconstructed and partially synthesized utilizing the received semantic map. This improves utilization of the wireless resources, with better preserving privacy of the non-intended classes. We design a communication/computation-aware scheme for per-class adaptation of the communication parameters, such as the transmission power and compression rate to minimize the total latency of retrieving signals at multiple receivers, tailored to the prevailing channel conditions as well as the users reconstruction/synthesis distortion/perception requirements. The simulation results demonstrate significantly reduced per-user latency compared with non-generative and intent-unaware multicasting benchmarks while maintaining high perceptual quality of the signals retrieved at the users.
翻译:生成扩散模型(GDMs)近期在合成具有高感知质量的多媒体信号方面展现出巨大成功,为未来无线网络中实现高效语义通信提供了可能。本文提出一种利用预训练扩散模型的意图感知生成式语义多播框架。在该框架中,发射机根据多用户意图将源信号分解为多个语义类别——即假设每个用户仅对部分语义类别的细节感兴趣。发射机随后仅向每个用户发送其目标类别,并通过共享无线资源向所有用户多播高度压缩的语义地图,使其能够利用预训练扩散模型在本地合成其他类别(即非目标类别)。每个用户最终获取的信号部分通过接收的语义地图重建,部分通过合成生成。该机制不仅提高了无线资源利用率,还能更好地保护非目标类别的隐私。我们设计了一种通信/计算感知方案,用于根据每类语义特征自适应调整通信参数(如发射功率和压缩率),以最小化多接收端信号获取的总延迟。该方案充分适配当前信道条件以及用户的重建/合成失真/感知质量要求。仿真结果表明,与非生成式及无意图感知的多播基准方法相比,所提方案在保持用户端信号高感知质量的同时,显著降低了单用户延迟。