Provably Reliable Classifier Guidance via Cross-Entropy Control

Classifier-guided diffusion models generate conditional samples by augmenting the reverse-time score with the gradient of the log-probability predicted by a probabilistic classifier. In practice, this classifier is usually obtained by minimizing an empirical loss function. While existing statistical theory guarantees good generalization performance when the sample size is sufficiently large, it remains unclear whether such training yields an effective guidance mechanism. We study this question in the context of cross-entropy loss, which is widely used for classifier training. Under mild smoothness assumptions on the classifier, we show that controlling the cross-entropy at each diffusion model step is sufficient to control the corresponding guidance error. In particular, probabilistic classifiers achieving conditional KL divergence $\varepsilon^2$ induce guidance vectors with mean squared error $\widetilde O(d \varepsilon )$, up to constant and logarithmic factors. Our result yields an upper bound on the sampling error of classifier-guided diffusion models and bears resemblance to a reverse log-Sobolev--type inequality. To the best of our knowledge, this is the first result that quantitatively links classifier training to guidance alignment in diffusion models, providing both a theoretical explanation for the empirical success of classifier guidance, and principled guidelines for selecting classifiers that induce effective guidance.

翻译：分类器引导的扩散模型通过将概率分类器预测的对数概率梯度与逆时分数相结合，生成条件样本。在实践中，此类分类器通常通过最小化经验损失函数获得。尽管现有统计理论保证在样本量足够大时具有良好的泛化性能，但此类训练是否会产生有效的引导机制仍不明确。本文在交叉熵损失的背景下研究该问题，该损失函数被广泛用于分类器训练。在分类器满足温和平滑性假设的条件下，我们证明控制扩散模型每个步骤的交叉熵足以控制相应的引导误差。具体而言，达到条件KL散度 $\varepsilon^2$ 的概率分类器可诱导均方误差为 $\widetilde O(d \varepsilon )$ 的引导向量（忽略常数项与对数因子）。我们的结果给出了分类器引导扩散模型采样误差的上界，其形式与反向对数Sobolev型不等式具有相似性。据我们所知，这是首个定量关联分类器训练与扩散模型中引导对齐的研究成果，既为分类器引导的经验成功提供了理论解释，也为选择能诱导有效引导的分类器提供了原则性指导。

相关内容

分类器

关注 6

分类是数据挖掘的一种非常重要的方法。分类的概念是在已有数据的基础上学会一个分类函数或构造出一个分类模型（即我们通常所说的分类器(Classifier)）。该函数或模型能够把数据库中的数据纪录映射到给定类别中的某一个，从而可以应用于数据预测。总之，分类器是数据挖掘中对样本进行分类的方法的统称，包含决策树、逻辑回归、朴素贝叶斯、神经网络等算法。

【博士论文】知识引导的序列决策算法：整合图结构、演示数据、人类经验与跨智能体经验

专知会员服务

15+阅读 · 3月30日

基于扩散模型和流模型的推理时引导生成技术

专知会员服务

16+阅读 · 2025年4月30日

扩散模型如何做好可控生成？基于奖励引导的控制生成用于扩散模型中的推理时对齐：教程与综述

专知会员服务

21+阅读 · 2025年1月20日

扩散模型概述：应用、引导生成、统计率和优化

专知会员服务

47+阅读 · 2024年4月14日