基于扩散的生成模型采样设计空间的形式化：自适应求解器与Wasserstein有界时间步 (Formalizing the Sampling Design Space of Diffusion-Based Generative Models via Adaptive Solvers and Wasserstein-Bounded Timesteps) - 专知论文

会员服务 ·

0

形式化 · 自适应 · 时间步 · SDM · 设计 ·

Formalizing the Sampling Design Space of Diffusion-Based Generative Models via Adaptive Solvers and Wasserstein-Bounded Timesteps

翻译：基于扩散的生成模型采样设计空间的形式化：自适应求解器与Wasserstein有界时间步

Sangwoo Jo,Sungjoon Choi

Diffusion-based generative models have achieved remarkable performance across various domains, yet their practical deployment is often limited by high sampling costs. While prior work focuses on training objectives or individual solvers, the holistic design of sampling, specifically solver selection and scheduling, remains dominated by static heuristics. In this work, we revisit this challenge through a geometric lens, proposing SDM, a principled framework that aligns the numerical solver with the intrinsic properties of the diffusion trajectory. By analyzing the ODE dynamics, we show that efficient low-order solvers suffice in early high-noise stages while higher-order solvers can be progressively deployed to handle the increasing non-linearity of later stages. Furthermore, we formalize the scheduling by introducing a Wasserstein-bounded optimization framework. This method systematically derives adaptive timesteps that explicitly bound the local discretization error, ensuring the sampling process remains faithful to the underlying continuous dynamics. Without requiring additional training or architectural modifications, SDM achieves state-of-the-art performance across standard benchmarks, including an FID of 1.93 on CIFAR-10, 2.41 on FFHQ, and 1.98 on AFHQv2, with a reduced number of function evaluations compared to existing samplers. Our code is available at https://github.com/aiimaginglab/sdm.

翻译：基于扩散的生成模型已在多个领域取得显著性能，但其实际部署常受限于高昂的采样成本。现有研究多集中于训练目标或单一求解器，而采样过程的整体设计——特别是求解器选择与调度策略——仍主要依赖静态启发式方法。本文从几何视角重新审视这一挑战，提出SDM这一原则性框架，将数值求解器与扩散轨迹的内在特性对齐。通过对常微分方程动力学的分析，我们证明低阶高效求解器在早期高噪声阶段已足够使用，而随着后期阶段非线性程度的增加，可逐步部署高阶求解器。进一步地，我们通过引入Wasserstein有界优化框架形式化了调度策略。该方法系统性地推导出自适应时间步，显式约束局部离散化误差，确保采样过程忠实于底层连续动力学。SDM无需额外训练或架构修改，即在标准基准测试中达到最先进性能：在CIFAR-10上获得1.93的FID，在FFHQ上获得2.41的FID，在AFHQv2上获得1.98的FID，且相比现有采样器减少了函数评估次数。代码已发布于https://github.com/aiimaginglab/sdm。

0

相关内容

形式化

144页ppt《扩散模型》，Google DeepMind Sander Dieleman

144页ppt《扩散模型》，Google DeepMind Sander Dieleman

专知会员服务

50+阅读 · 2025年11月21日

扩散模型中的缓存方法综述：迈向高效的多模态生成

扩散模型中的缓存方法综述：迈向高效的多模态生成

专知会员服务

8+阅读 · 2025年10月23日

用于语言生成的离散扩散模型

用于语言生成的离散扩散模型

专知会员服务

11+阅读 · 2025年7月10日

扩散模型量化综述

扩散模型量化综述

专知会员服务

18+阅读 · 2025年5月11日

生成扩散模型研究综述

生成扩散模型研究综述

专知会员服务

37+阅读 · 2024年12月19日

生成式人工智能的扩散模型概述

生成式人工智能的扩散模型概述

专知会员服务

66+阅读 · 2024年12月8日

扩散模型概述：应用、引导生成、统计率和优化

扩散模型概述：应用、引导生成、统计率和优化

专知会员服务

47+阅读 · 2024年4月14日

何恺明等最新步步解构扩散模型，最后竟成经典去噪自编码器

何恺明等最新步步解构扩散模型，最后竟成经典去噪自编码器

专知会员服务

33+阅读 · 2024年1月28日

视觉的有效扩散模型综述

视觉的有效扩散模型综述

专知会员服务

97+阅读 · 2022年10月20日

扩散模型综述又一弹！西湖大学李子青等最新《生成式扩散模型》综述，18页pdf详解扩散模型基础、方法体系和应用

扩散模型综述又一弹！西湖大学李子青等最新《生成式扩散模型》综述，18页pdf详解扩散模型基础、方法体系和应用

专知会员服务

121+阅读 · 2022年9月9日

港科大浙大最新《深度生成模型三维表示》综述，20页pdf全面阐述3D生成进展

港科大浙大最新《深度生成模型三维表示》综述，20页pdf全面阐述3D生成进展

专知

12+阅读 · 2022年10月31日

最新《深度生成式模型进展》视频报告，43页ppt，斯坦福Aditya Grover

最新《深度生成式模型进展》视频报告，43页ppt，斯坦福Aditya Grover

专知

13+阅读 · 2020年8月9日

【加州理工】什么是模仿学习(Imitation Learning（模仿学习), 这62页ppt带你了解进展，附下载

【加州理工】什么是模仿学习(Imitation Learning（模仿学习), 这62页ppt带你了解进展，附下载

专知

21+阅读 · 2019年11月14日

阿里深度序列匹配模型SDM：如何刻画大型推荐系统的用户行为？

阿里深度序列匹配模型SDM：如何刻画大型推荐系统的用户行为？

AI100

21+阅读 · 2019年9月10日

Keras作者推荐的Github项目，基于TensorFlow2的生成式模型合集

Keras作者推荐的Github项目，基于TensorFlow2的生成式模型合集

专知

15+阅读 · 2019年5月17日

基于模型系统的系统设计

基于模型系统的系统设计

科技导报

10+阅读 · 2019年4月25日

微软最新论文解读 | 基于预训练自然语言生成的文本摘要方法

微软最新论文解读 | 基于预训练自然语言生成的文本摘要方法

PaperWeekly

14+阅读 · 2019年3月18日

深度学习中Attention Mechanism详细介绍：原理、分类及应用

深度学习中Attention Mechanism详细介绍：原理、分类及应用

深度学习与NLP

10+阅读 · 2019年2月18日

深度强化学习首次在无监督视频摘要生成问题中的应用：实现state-of-the-art效果

深度强化学习首次在无监督视频摘要生成问题中的应用：实现state-of-the-art效果

专知

26+阅读 · 2018年1月21日

模型汇总24 - 深度学习中Attention Mechanism详细介绍：原理、分类及应用

模型汇总24 - 深度学习中Attention Mechanism详细介绍：原理、分类及应用

深度学习与NLP

12+阅读 · 2017年11月30日

基于微型批量采样的分布式多智能个体网络协同优化算法研究

国家自然科学基金

0+阅读 · 2015年12月31日

基于支撑函数的不规则形态扩展目标建模和估计研究

国家自然科学基金

0+阅读 · 2015年12月31日

基于自适应采样和变复杂度近似的多学科稳健性设计优化方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

黎曼流形上几类反应扩散方程（组）解的整体存在性

国家自然科学基金

0+阅读 · 2015年12月31日

变指数模化空间的特征及其应用

国家自然科学基金

0+阅读 · 2014年12月31日

插值条件下DEM误差的空间自相关模型研究

国家自然科学基金

1+阅读 · 2014年12月31日

扩展离散可积系统的构造、求解及应用

国家自然科学基金

0+阅读 · 2014年12月31日

面向时空变化的GIS数据模型

国家自然科学基金

6+阅读 · 2014年12月31日

城市群空间交互情景分析与多尺度协同模拟

国家自然科学基金

0+阅读 · 2014年12月31日

基于结构化方法的复杂研发项目多领域集成分析与优化研究

国家自然科学基金

2+阅读 · 2014年12月31日

Discrete State Diffusion Models: A Sample Complexity Perspective

Arxiv

0+阅读 · 2月14日

Decoupled Diffusion Sampling for Inverse Problems on Function Spaces

Arxiv

0+阅读 · 2月12日

Improving Efficiency of Diffusion Models via Multi-Stage Framework and Tailored Multi-Decoder Architectures

Arxiv

0+阅读 · 2月12日

Algorithm- and Data-Dependent Generalization Bounds for Diffusion Models

Arxiv

0+阅读 · 2月9日

PairFlow: Closed-Form Source-Target Coupling for Few-Step Generation in Discrete Flow Models

Arxiv

0+阅读 · 2月8日

A-FloPS: Accelerating Diffusion Models via Adaptive Flow Path Sampler

Arxiv

0+阅读 · 2月8日

Remasking Discrete Diffusion Models with Inference-Time Scaling

Arxiv

0+阅读 · 2月7日

Improving Diffusion Language Model Decoding through Joint Search in Generation Order and Token Space

Arxiv

0+阅读 · 2月5日

F-scheduler: illuminating the free-lunch design space for fast sampling of diffusion models

Arxiv

0+阅读 · 1月31日

Time-Annealed Perturbation Sampling: Diverse Generation for Diffusion Language Models

Arxiv

0+阅读 · 1月30日

VIP会员

文章信息

相关主题

相关VIP内容

144页ppt《扩散模型》，Google DeepMind Sander Dieleman

144页ppt《扩散模型》，Google DeepMind Sander Dieleman

专知会员服务

50+阅读 · 2025年11月21日

扩散模型中的缓存方法综述：迈向高效的多模态生成

扩散模型中的缓存方法综述：迈向高效的多模态生成

专知会员服务

8+阅读 · 2025年10月23日

用于语言生成的离散扩散模型

用于语言生成的离散扩散模型

专知会员服务

11+阅读 · 2025年7月10日

扩散模型量化综述

扩散模型量化综述

专知会员服务

18+阅读 · 2025年5月11日

生成扩散模型研究综述

生成扩散模型研究综述

专知会员服务

37+阅读 · 2024年12月19日

生成式人工智能的扩散模型概述

生成式人工智能的扩散模型概述

专知会员服务

66+阅读 · 2024年12月8日

扩散模型概述：应用、引导生成、统计率和优化

扩散模型概述：应用、引导生成、统计率和优化

专知会员服务

47+阅读 · 2024年4月14日

何恺明等最新步步解构扩散模型，最后竟成经典去噪自编码器

何恺明等最新步步解构扩散模型，最后竟成经典去噪自编码器

专知会员服务

33+阅读 · 2024年1月28日

视觉的有效扩散模型综述

视觉的有效扩散模型综述

专知会员服务

97+阅读 · 2022年10月20日

扩散模型综述又一弹！西湖大学李子青等最新《生成式扩散模型》综述，18页pdf详解扩散模型基础、方法体系和应用

扩散模型综述又一弹！西湖大学李子青等最新《生成式扩散模型》综述，18页pdf详解扩散模型基础、方法体系和应用

专知会员服务

121+阅读 · 2022年9月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《可信人工智能赋能系统的支柱》

《从经典神经网络到不确定性下的拓扑神经网络：军事应用》2026最新40页报告

人工智能赋能边缘与自主系统：美陆军现代化进程聚焦威胁探测与战术边缘情报

《人工智能：对战略与力量的影响》slides

相关资讯

港科大浙大最新《深度生成模型三维表示》综述，20页pdf全面阐述3D生成进展

港科大浙大最新《深度生成模型三维表示》综述，20页pdf全面阐述3D生成进展

专知

12+阅读 · 2022年10月31日

最新《深度生成式模型进展》视频报告，43页ppt，斯坦福Aditya Grover

最新《深度生成式模型进展》视频报告，43页ppt，斯坦福Aditya Grover

专知

13+阅读 · 2020年8月9日

【加州理工】什么是模仿学习(Imitation Learning（模仿学习), 这62页ppt带你了解进展，附下载

【加州理工】什么是模仿学习(Imitation Learning（模仿学习), 这62页ppt带你了解进展，附下载

专知

21+阅读 · 2019年11月14日

阿里深度序列匹配模型SDM：如何刻画大型推荐系统的用户行为？

阿里深度序列匹配模型SDM：如何刻画大型推荐系统的用户行为？

AI100

21+阅读 · 2019年9月10日

Keras作者推荐的Github项目，基于TensorFlow2的生成式模型合集

Keras作者推荐的Github项目，基于TensorFlow2的生成式模型合集

专知

15+阅读 · 2019年5月17日

基于模型系统的系统设计

基于模型系统的系统设计

科技导报

10+阅读 · 2019年4月25日

微软最新论文解读 | 基于预训练自然语言生成的文本摘要方法

微软最新论文解读 | 基于预训练自然语言生成的文本摘要方法

PaperWeekly

14+阅读 · 2019年3月18日

深度学习中Attention Mechanism详细介绍：原理、分类及应用

深度学习中Attention Mechanism详细介绍：原理、分类及应用

深度学习与NLP

10+阅读 · 2019年2月18日

深度强化学习首次在无监督视频摘要生成问题中的应用：实现state-of-the-art效果

深度强化学习首次在无监督视频摘要生成问题中的应用：实现state-of-the-art效果

专知

26+阅读 · 2018年1月21日

模型汇总24 - 深度学习中Attention Mechanism详细介绍：原理、分类及应用

模型汇总24 - 深度学习中Attention Mechanism详细介绍：原理、分类及应用

深度学习与NLP

12+阅读 · 2017年11月30日

相关论文

Discrete State Diffusion Models: A Sample Complexity Perspective

Arxiv

0+阅读 · 2月14日

Decoupled Diffusion Sampling for Inverse Problems on Function Spaces

Arxiv

0+阅读 · 2月12日

Improving Efficiency of Diffusion Models via Multi-Stage Framework and Tailored Multi-Decoder Architectures

Arxiv

0+阅读 · 2月12日

Algorithm- and Data-Dependent Generalization Bounds for Diffusion Models

Arxiv

0+阅读 · 2月9日

PairFlow: Closed-Form Source-Target Coupling for Few-Step Generation in Discrete Flow Models

Arxiv

0+阅读 · 2月8日

A-FloPS: Accelerating Diffusion Models via Adaptive Flow Path Sampler

Arxiv

0+阅读 · 2月8日

Remasking Discrete Diffusion Models with Inference-Time Scaling

Arxiv

0+阅读 · 2月7日

Improving Diffusion Language Model Decoding through Joint Search in Generation Order and Token Space

Arxiv

0+阅读 · 2月5日

F-scheduler: illuminating the free-lunch design space for fast sampling of diffusion models

Arxiv

0+阅读 · 1月31日

Time-Annealed Perturbation Sampling: Diverse Generation for Diffusion Language Models

Arxiv

0+阅读 · 1月30日

相关基金

基于微型批量采样的分布式多智能个体网络协同优化算法研究

国家自然科学基金

0+阅读 · 2015年12月31日

基于支撑函数的不规则形态扩展目标建模和估计研究

国家自然科学基金

0+阅读 · 2015年12月31日

基于自适应采样和变复杂度近似的多学科稳健性设计优化方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

黎曼流形上几类反应扩散方程（组）解的整体存在性

国家自然科学基金

0+阅读 · 2015年12月31日

变指数模化空间的特征及其应用

国家自然科学基金

0+阅读 · 2014年12月31日

插值条件下DEM误差的空间自相关模型研究

国家自然科学基金

1+阅读 · 2014年12月31日

扩展离散可积系统的构造、求解及应用

国家自然科学基金

0+阅读 · 2014年12月31日

面向时空变化的GIS数据模型

国家自然科学基金

6+阅读 · 2014年12月31日

城市群空间交互情景分析与多尺度协同模拟

国家自然科学基金

0+阅读 · 2014年12月31日

基于结构化方法的复杂研发项目多领域集成分析与优化研究

国家自然科学基金

2+阅读 · 2014年12月31日

微信扫码咨询专知VIP会员