Due to the surging amount of AI-generated images, its provisioning to edges and mobile users from the cloud incurs substantial traffic on networks. Generative semantic communication (GSC) offers a promising solution by transmitting highly compact information, i.e., prompt text and latent representations, instead of high-dimensional image data. However, GSC relies on the alignment between the knowledge in the cloud generative AI (GAI) and that possessed by the edges and users, and between the knowledge for wireless transmission and that of actual channels, which remains challenging. In this paper, we propose DeKA-g, a distillation-enabled knowledge alignment algorithm for GSC systems. The core idea is to distill the image generation knowledge from the cloud-GAI into low-rank matrices, which can be incorporated by the edge and used to adapt the transmission knowledge to diverse wireless channel conditions. DeKA-g comprises two novel methods: metaword-aided knowledge distillation (MAKD) and condition-aware low-rank adaptation (CALA). For MAKD, an optimized metaword is employed to enhance the efficiency of knowledge distillation, while CALA enables efficient adaptation to diverse rate requirements and channel conditions. From simulation results, DeKA-g improves the consistency between the edge-generated images and the cloud-generated ones by 44% and enahnces the average transmission quality in terms of PSNR by 6.5 dB over the baselines without knowledge alignment.
翻译:由于AI生成图像数量激增,将其从云端提供给边缘和移动用户会给网络带来巨大流量负担。生成式语义通信通过传输高度紧凑的信息(即提示文本和潜在表示)而非高维图像数据,为此提供了有前景的解决方案。然而,GSC的实现依赖于云端生成式人工智能所具备的知识与边缘及用户端知识之间的对齐,以及无线传输知识与实际信道特性之间的匹配,这仍然是当前面临的挑战。本文提出DeKA-g算法——一种基于蒸馏的知识对齐方法,专为GSC系统设计。其核心思想是将云端GAI中的图像生成知识蒸馏为低秩矩阵,这些矩阵可被边缘侧集成,并用于使传输知识适应多样化的无线信道条件。DeKA-g包含两项创新方法:元词辅助知识蒸馏与条件感知低秩适配。在MAKD中,通过优化设计的元词提升知识蒸馏效率;而CALA则能根据不同的速率需求和信道条件实现高效适配。仿真结果表明,与未进行知识对齐的基线方法相比,DeKA-g将边缘生成图像与云端生成图像的一致性提升了44%,并在PSNR指标上将平均传输质量提高了6.5 dB。