Energy-Based Models (EBMs) offer a versatile framework for modeling complex data distributions. However, training and sampling from EBMs continue to pose significant challenges. The widely-used Denoising Score Matching (DSM) method for scalable EBM training suffers from inconsistency issues, causing the energy model to learn a `noisy' data distribution. In this work, we propose an efficient sampling framework: (pseudo)-Gibbs sampling with moment matching, which enables effective sampling from the underlying clean model when given a `noisy' model that has been well-trained via DSM. We explore the benefits of our approach compared to related methods and demonstrate how to scale the method to high-dimensional datasets.
翻译:能量基模型(EBMs)为复杂数据分布建模提供了通用框架,但其训练与采样过程仍面临重大挑战。广泛用于可扩展EBM训练的去噪分数匹配(DSM)方法存在不一致性问题,导致能量模型学习到“含噪”数据分布。本文提出一种高效采样框架:基于矩匹配的(伪)吉布斯采样,该框架可在给定通过DSM充分训练的“含噪”模型时,有效从底层干净模型中采样。我们探讨了本方法相较于相关方法的优势,并展示了如何将该方法扩展到高维数据集。