Diffusion models have achieved state-of-the-art performance in generating many different kinds of data, including images, text, and videos. Despite their success, there has been limited research on how the underlying diffusion process and the final convergent prior can affect generative performance; this research has also been limited to continuous data types and a score-based diffusion framework. To fill this gap, we explore how different discrete diffusion kernels (which converge to different prior distributions) affect the performance of diffusion models for graphs. To this end, we developed a novel formulation of a family of discrete diffusion kernels which are easily adjustable to converge to different Bernoulli priors, and we study the effect of these different kernels on generative performance. We show that the quality of generated graphs is sensitive to the prior used, and that the optimal choice cannot be explained by obvious statistics or metrics, which challenges the intuitions which previous works have suggested.
翻译:扩散模型在生成图像、文本和视频等多种数据类型方面已取得最先进性能。尽管取得了成功,但关于底层扩散过程及最终收敛先验如何影响生成性能的研究仍十分有限;且现有研究仅限于连续数据类型和基于分数的扩散框架。为填补这一空白,我们探究了不同离散扩散核(收敛至不同先验分布)对图扩散模型性能的影响。为此,我们提出了一种新型离散扩散核族的形式化表述,该核族可灵活调整以收敛至不同伯努利先验,并系统研究了这些不同核函数对生成性能的影响。研究表明,生成图的质量对所选先验高度敏感,且最优选择无法通过直观统计量或指标进行解释,这挑战了前人研究中提出的直觉假设。