Generative artificial intelligence is now a widely used tool in molecular science. Despite the popularity of probabilistic generative models, numerical experiments benchmarking their performance on molecular data are lacking. In this work, we introduce and explain several classes of generative models, broadly sorted into two categories: flow-based models and diffusion models. We select three representative models: Neural Spline Flows, Conditional Flow Matching, and Denoising Diffusion Probabilistic Models, and examine their accuracy, computational cost, and generation speed across datasets with tunable dimensionality, complexity, and modal asymmetry. Our findings are varied, with no one framework being the best for all purposes. In a nutshell, (i) Neural Spline Flows do best at capturing mode asymmetry present in low-dimensional data, (ii) Conditional Flow Matching outperforms other models for high-dimensional data with low complexity, and (iii) Denoising Diffusion Probabilistic Models appears the best for low-dimensional data with high complexity. Our datasets include a Gaussian mixture model and the dihedral torsion angle distribution of the Aib\textsubscript{9} peptide, generated via a molecular dynamics simulation. We hope our taxonomy of probabilistic generative frameworks and numerical results may guide model selection for a wide range of molecular tasks.
翻译:生成式人工智能现已成为分子科学中广泛使用的工具。尽管概率生成模型广受欢迎,但针对其在分子数据上性能的基准数值实验仍较为缺乏。本文介绍并解释了几类生成模型,大致分为两类:基于流的模型和扩散模型。我们选取了三种代表性模型:神经样条流、条件流匹配和去噪扩散概率模型,并在具有可调维度、复杂性和模态不对称性的数据集上考察了它们的准确性、计算成本和生成速度。我们的研究结果多样,没有一种框架在所有用途上都是最优的。简而言之:(i) 神经样条流在捕捉低维数据中存在的模态不对称性方面表现最佳;(ii) 对于低复杂度的高维数据,条件流匹配优于其他模型;(iii) 对于高复杂度的低维数据,去噪扩散概率模型似乎是最佳选择。我们的数据集包括一个高斯混合模型以及通过分子动力学模拟生成的Aib\textsubscript{9}肽的二面角扭转角分布。我们希望我们对概率生成框架的分类和数值结果能够为广泛的分子任务中的模型选择提供指导。