The variable-length Reverse Multi-Delimiter (RMD) codes are known to represent sequences of unbounded and unordered integers. When applied to data compression, they combine a good compression ratio with fast decoding. In this paper, we investigate another property of RMD-codes - the ability of direct access to codewords in the encoded bitstream. We present the method allowing us to extract and decode a codeword from an RMD-bitstream in almost constant time with the tiny space overhead, and make experiments on its application to natural language text compression.
翻译:变长反向多分隔符(RMD)编码能够表示无界且无序的整数序列。在数据压缩应用中,该类编码兼具良好的压缩比与快速解码能力。本文研究了RMD编码的另一个特性——对编码比特流中码字的直接访问能力。我们提出一种方法,能以极小的空间开销在近乎恒定时间内从RMD比特流中提取并解码码字,并通过实验验证了该方法在自然语言文本压缩中的应用效果。