DNA labeling is a powerful tool in molecular biology and biotechnology that allows for the visualization, detection, and study of DNA at the molecular level. Under this paradigm, a DNA molecule is being labeled by specific k patterns and is then imaged. Then, the resulted image is modeled as a (k + 1)- ary sequence in which any non-zero symbol indicates on the appearance of the corresponding label in the DNA molecule. The primary goal of this work is to study the labeling capacity, which is defined as the maximal information rate that can be obtained using this labeling process. The labeling capacity is computed for any single label and several results are provided for multiple labels as well. Moreover, we provide the optimal minimal number of labels of length one or two that are needed in order to gain labeling capacity of 2.
翻译:DNA标记是分子生物学与生物技术中的一种强有力工具,能够实现DNA在分子层面的可视化、检测与研究。在该范式下,DNA分子通过特定的k种模式进行标记并被成像。随后,所得图像被建模为一个(k+1)元序列,其中任何非零符号都表示DNA分子中相应标记的出现。本研究的主要目标是探究标记容量,即通过该标记过程可获得的最大信息速率。针对单一标签,我们计算了标记容量,并就多重标签提供了若干结果。此外,我们给出了为达到2的标记容量所需的最优最小标签数量,其长度为一或二。