We introduce the first continuous-time score-based generative model that leverages fractional diffusion processes for its underlying dynamics. Although diffusion models have excelled at capturing data distributions, they still suffer from various limitations such as slow convergence, mode-collapse on imbalanced data, and lack of diversity. These issues are partially linked to the use of light-tailed Brownian motion (BM) with independent increments. In this paper, we replace BM with an approximation of its non-Markovian counterpart, fractional Brownian motion (fBM), characterized by correlated increments and Hurst index $H \in (0,1)$, where $H=0.5$ recovers the classical BM. To ensure tractable inference and learning, we employ a recently popularized Markov approximation of fBM (MA-fBM) and derive its reverse-time model, resulting in generative fractional diffusion models (GFDM). We characterize the forward dynamics using a continuous reparameterization trick and propose augmented score matching to efficiently learn the score function, which is partly known in closed form, at minimal added cost. The ability to drive our diffusion model via MA-fBM offers flexibility and control. $H \leq 0.5$ enters the regime of rough paths whereas $H>0.5$ regularizes diffusion paths and invokes long-term memory. The Markov approximation allows added control by varying the number of Markov processes linearly combined to approximate fBM. Our evaluations on real image datasets demonstrate that GFDM achieves greater pixel-wise diversity and enhanced image quality, as indicated by a lower FID, offering a promising alternative to traditional diffusion models
翻译:我们提出了首个利用分数扩散过程作为底层动力学的连续时间基于分数的生成模型。尽管扩散模型在捕捉数据分布方面表现出色,但它们仍存在各种局限性,如收敛速度慢、在非平衡数据上出现模式崩溃以及缺乏多样性。这些问题部分与使用具有独立增量的轻尾布朗运动(BM)有关。在本文中,我们将BM替换为其非马尔可夫对应物——分数布朗运动(fBM)的一种近似,其特征是具有相关增量以及赫斯特指数$H \in (0,1)$,其中$H=0.5$对应经典的BM。为确保可处理的推断和学习,我们采用了一种近期流行的fBM马尔可夫近似(MA-fBM),并推导出其逆时间模型,从而得到生成式分数扩散模型(GFDM)。我们使用连续重参数化技巧来刻画前向动力学,并提出增强分数匹配方法,以最小额外成本高效地学习分数函数(其部分闭式解已知)。通过MA-fBM驱动我们的扩散模型,提供了灵活性和控制能力。$H \leq 0.5$进入粗糙路径机制,而$H>0.5$则正则化扩散路径并引入长期记忆。马尔可夫近似通过改变线性组合以近似fBM的马尔可夫过程数量,提供了额外的控制手段。我们在真实图像数据集上的评估表明,GFDM实现了更高的像素级多样性和更好的图像质量(由更低的FID分数指示),为传统扩散模型提供了一种有前景的替代方案。