Deep generative chemistry models emerge as powerful tools to expedite drug discovery. However, the immense size and complexity of the structural space of all possible drug-like molecules pose significant obstacles, which could be overcome with hybrid architectures combining quantum computers with deep classical networks. As the first step toward this goal, we built a compact discrete variational autoencoder (DVAE) with a Restricted Boltzmann Machine (RBM) of reduced size in its latent layer. The size of the proposed model was small enough to fit on a state-of-the-art D-Wave quantum annealer and allowed training on a subset of the ChEMBL dataset of biologically active compounds. Finally, we generated 2331 novel chemical structures with medicinal chemistry and synthetic accessibility properties in the ranges typical for molecules from ChEMBL. The presented results demonstrate the feasibility of using already existing or soon-to-be-available quantum computing devices as testbeds for future drug discovery applications.
翻译:深度生成化学模型已成为加速药物发现的强大工具。然而,所有可能的类药分子结构空间极为庞大且复杂,这构成了重大障碍,而结合量子计算机与深度经典网络的混合架构有望克服这些难题。作为迈向这一目标的第一步,我们构建了一个紧凑型离散变分自动编码器(DVAE),其潜在层采用尺寸缩减的受限玻尔兹曼机(RBM)。所提出模型的规模足够小,可适配于最先进的D-Wave量子退火器,并允许在ChEMBL生物活性化合物数据集的子集上进行训练。最终,我们生成了2331种新型化学结构,其药物化学和合成可及性属性均处于ChEMBL分子典型范围内。本研究结果证明了利用现有或即将可用的量子计算设备作为未来药物发现应用试验台的可行性。