Diffusion-based Generative Multicasting with Intent-aware Semantic Decomposition

Generative diffusion models (GDMs) have recently shown great success in synthesizing multimedia signals with high perceptual quality enabling highly efficient semantic communications in future wireless networks. In this paper, we develop an intent-aware generative semantic multicasting framework utilizing pre-trained diffusion models. In the proposed framework, the transmitter decomposes the source signal to multiple semantic classes based on the multi-user intent, i.e. each user is assumed to be interested in details of only a subset of the semantic classes. The transmitter then sends to each user only its intended classes, and multicasts a highly compressed semantic map to all users over shared wireless resources that allows them to locally synthesize the other classes, i.e. non-intended classes, utilizing pre-trained diffusion models. The signal retrieved at each user is thereby partially reconstructed and partially synthesized utilizing the received semantic map. This improves utilization of the wireless resources, with better preserving privacy of the non-intended classes. We design a communication/computation-aware scheme for per-class adaptation of the communication parameters, such as the transmission power and compression rate to minimize the total latency of retrieving signals at multiple receivers, tailored to the prevailing channel conditions as well as the users reconstruction/synthesis distortion/perception requirements. The simulation results demonstrate significantly reduced per-user latency compared with non-generative and intent-unaware multicasting benchmarks while maintaining high perceptual quality of the signals retrieved at the users.

翻译：生成扩散模型（GDMs）近期在合成具有高感知质量的多媒体信号方面展现出巨大成功，为未来无线网络中实现高效语义通信提供了可能。本文提出一种利用预训练扩散模型的意图感知生成式语义多播框架。在该框架中，发射机根据多用户意图将源信号分解为多个语义类别——即假设每个用户仅对部分语义类别的细节感兴趣。发射机随后仅向每个用户发送其目标类别，并通过共享无线资源向所有用户多播高度压缩的语义地图，使其能够利用预训练扩散模型在本地合成其他类别（即非目标类别）。每个用户最终获取的信号部分通过接收的语义地图重建，部分通过合成生成。该机制不仅提高了无线资源利用率，还能更好地保护非目标类别的隐私。我们设计了一种通信/计算感知方案，用于根据每类语义特征自适应调整通信参数（如发射功率和压缩率），以最小化多接收端信号获取的总延迟。该方案充分适配当前信道条件以及用户的重建/合成失真/感知质量要求。仿真结果表明，与非生成式及无意图感知的多播基准方法相比，所提方案在保持用户端信号高感知质量的同时，显著降低了单用户延迟。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日