This paper examines the maximum code rate achievable by a data-driven communication system over some unknown discrete memoryless channel in the finite blocklength regime. A class of channel codes, called learning-based channel codes, is first introduced. Learning-based channel codes include a learning algorithm to transform the training data into a pair of encoding and decoding functions that satisfy some statistical reliability constraint. Data-dependent achievability and converse bounds in the non-asymptotic regime are established for this class of channel codes. It is shown analytically that the asymptotic expansion of the bounds for the maximum achievable code rate of the learning-based channel codes are tight for sufficiently large training data.
翻译:本文研究了在有限块长体制下,针对未知离散无记忆信道,数据驱动通信系统所能达到的最大码率。首先引入一类被称为基于学习的信道编码的信道编码方案。基于学习的信道编码包含一个学习算法,该算法将训练数据转化为满足某种统计可靠性约束的编码函数与译码函数对。针对此类信道编码,建立了非渐进框架下的数据依赖性可达性与逆界。分析表明,当训练数据足够大时,基于学习的信道编码最大可达码率的界限的渐进展开式是紧致的。