We propose a novel coding scheme for DNA-based storage systems, called the shift-interleave (SI) coding, designed to correct insertion, deletion, and substitution (IDS) errors, as well as sequence losses. The SI coding scheme employs multiple codewords from two binary low-density parity-check codes. These codewords are processed to form DNA base sequences through shifting, bit-to-base mapping, and interleaving. At the receiver side, an efficient non-iterative detection and decoding scheme is employed to sequentially estimate codewords. The numerical results demonstrate the excellent performance of the SI coding scheme in correcting both IDS errors and sequence losses.
翻译:本文提出一种针对DNA存储系统的新型编码方案,称为移位交织(SI)编码,旨在纠正插入、删除、替换(IDS)错误及序列丢失。该编码方案采用来自两个二进制低密度奇偶校验码的多个码字,通过移位、比特到碱基映射和交织处理生成DNA碱基序列。在接收端,采用高效的非迭代检测与解码方案逐次估计码字。数值结果表明,该SI编码方案在纠正IDS错误和序列丢失方面具有卓越性能。