Storing data in DNA is being explored as an efficient solution for archiving and in-object storage. Synthesis time and cost remain challenging, significantly limiting some applications at this stage. In this paper we investigate efficient synthesis, as it relates to cyclic synchronized synthesis technologies, such as photolithography. We define performance metrics related to the number of cycles needed for the synthesis of any fixed number of bits. We first expand on some results from the literature related to the channel capacity, addressing densities beyond those covered by prior work. This leads us to develop effective encoding achieving rate and capacity that are higher than previously reported. Finally, we analyze cost based on a parametric definition and determine some bounds and asymptotics. We investigate alphabet sizes that can be larger than 4, both for theoretical completeness and since practical approaches to such schemes were recently suggested and tested in the literature.
翻译:将数据存储在DNA中正被探索为归档和对象内存储的高效解决方案。合成时间和成本仍然具有挑战性,这在现阶段显著限制了某些应用。本文研究了与循环同步合成技术(如光刻技术)相关的高效合成问题。我们定义了与合成任意固定比特数所需周期数相关的性能指标。我们首先扩展了文献中关于信道容量的一些结果,处理了先前工作未涵盖的更高密度情况。这促使我们开发出实现比先前报道更高码率和容量的有效编码方案。最后,我们基于参数化定义分析成本,并确定了一些界限和渐近特性。我们研究了可能大于4的字母表大小,这既是出于理论完备性的考虑,也因为此类方案的实际方法最近已在文献中被提出和测试。