Sampling from complex, unnormalized probability densities is a fundamental challenge in Bayesian inference and probabilistic modeling. While Markov chain Monte Carlo (MCMC) methods provide asymptotic guarantees, they often suffer from slow mixing and high computational costs due to fixed or manually tuned trajectory lengths. In this work, we propose a novel framework that treats trajectory termination as a learnable component of the sampling dynamics. By framing MCMC within the theory of non-acyclic generative flow networks (GFlowNets), we train state-dependent neural classifiers to decide when a trajectory has reached a high-density region and should terminate. We theoretically establish the connection between optimal classifiers and the target density via detailed balance conditions and introduce a multilevel training scheme to facilitate exploration in complex geometries. Experimental results across various benchmark densities demonstrate that our approach significantly reduces average trajectory lengths while improving mode coverage and mixing compared to standard MCMC baselines.
翻译:从复杂、非归一化概率密度中进行采样是贝叶斯推断和概率建模中的基本挑战。尽管马尔可夫链蒙特卡洛方法提供了渐近保证,但由于固定或手动调优的轨迹长度,它们常面临混合缓慢和高计算成本的问题。本文提出了一种新颖框架,将轨迹终止视为采样动力学中可学习的组成部分。通过将马尔可夫链蒙特卡洛置于非循环生成流网络理论中,我们训练状态依赖的神经分类器,用于判断轨迹何时到达高密度区域并应终止。我们通过细致平衡条件从理论上建立了最优分类器与目标密度之间的联系,并引入了一种多级训练方案以促进复杂几何空间中的探索。在多个基准密度上的实验结果表明,与标准马尔可夫链蒙特卡洛基线相比,我们的方法显著减少了平均轨迹长度,同时提升了模式覆盖率和混合效率。