While Separate Source-Channel Coding (SSCC) retains the practical benefits of modular system design, its effectiveness in noisy text transmission is fundamentally constrained by the fragility of autoregressive source decoding. In low-SNR regimes, even a small number of residual bit errors after channel decoding may derail the subsequent lossless reconstruction process, especially when Arithmetic Coding (AC) relies on Large Language Model (LLM)-based probability estimation. Existing remedies either strengthen channel decoding based solely on channel observations or introduce contextual information only at the receiver for post-hoc correction, yet neither fully addresses the fragility of source probability modeling under residual channel errors. To this end, this paper proposes a Memory-Augmented Source Coding (MASC) scheme for robust SSCC-based transmission. Rather than treating context as external side information, MASC internalizes contextual patterns into a source model shared by both the transmitter-side source encoder and the receiver-side source decoder. Specifically, MASC employs a shared Parameterized Contextual Memory (PCM) to encode multi-order $n$-gram patterns, and further introduces a Mixture-of-Memory-Experts Router (MMER) to perform sparse, hidden-state-dependent routing over memory experts during autoregressive source modeling. By adaptively activating only the most relevant memories at each coding step, MASC refines source probability estimation, shortens average codelength, and mitigates the sensitivity of source decoding to residual channel errors. Extensive experiments over Rayleigh fading and AWGN channels demonstrate the effectiveness of the proposed scheme compared with state-of-the-art methods.
翻译:尽管分离信源信道编码(SSCC)保留了模块化系统设计的实际优势,但其在噪声文本传输中的有效性受到自回归信源解码脆弱性的根本限制。在低信噪比条件下,即使信道解码后仅有少量残余比特错误,也可能导致后续无损重构过程偏离轨道,特别是当算术编码(AC)依赖于基于大语言模型(LLM)的概率估计时。现有补救措施要么仅基于信道观测值加强信道解码,要么仅在接收端引入上下文信息进行事后修正,但两者均未能完全解决残余信道错误下信源概率建模的脆弱性问题。为此,本文提出了一种记忆增强型信源编码(MASC)方案,用于实现鲁棒的基于SSCC的传输。MASC并非将上下文视为外部辅助信息,而是将上下文模式内化到由发送端信源编码器和接收端信源解码器共享的信源模型中。具体地,MASC采用共享的参数化上下文记忆(PCM)来编码多阶n-gram模式,并进一步引入混合记忆专家路由器(MMER),在自回归信源建模过程中对记忆专家执行稀疏的、依赖于隐态的路由选择。通过在每个编码步骤自适应地激活最相关的记忆,MASC优化了信源概率估计、缩短了平均码长,并减轻了信源解码对残余信道错误的敏感性。在瑞利衰落和高斯白噪声信道上的大量实验表明,与最先进方法相比,所提方案具有有效性。