True Self-Avoiding Walk for Accelerating Markov-Chain Monte Carlo Integration

We study true self-avoiding walk (TSAW) as a mechanism for improving empirical integral estimation via Markov chain Monte Carlo (MCMC). We consider finite-state adaptive sampling dynamics associated with an irreducible Markov kernel $P$ on a finite set, with stationary distribution $π$, in which the transition probabilities are penalized according to empirical overuse. Our main result is that the empirical occupation counts $L_t(i)$ and transition counts $N_t(i,j)$ of the resulting TSAW-based walk satisfy \[ L_t(i)-tπ_i = O(\sqrt{\log t}) \quad\text{and}\quad N_t(i,j)-tπ_iP_{ij}=O(\sqrt{\log t}) \qquad\text{almost surely} \] for every state $i$ and every edge $(i,j)$ with $P_{ij}>0$. Consequently, for every bounded function $f:V\to\mathbb R$, the error of our integral estimator converges as \[ \left|\frac1t\sum_{s=0}^{t-1} f(X_s)-\sum_{i\in V}π_i f(i)\right| = O\left(\frac{\sqrt{\log t}}{t}\right) \qquad\text{almost surely}. \] These results show that, in contrast with the usual $t^{-1/2}$ error scaling for empirical averages under standard random-walk-based methods, TSAW-based estimator yields empirical integral errors of order $O(\sqrt{\log t}/t)$ almost surely, thereby achieving a substantially sharper dependence on the sample size $t$.

翻译：我们研究了真正自回避行走（TSAW）作为一种通过马尔可夫链蒙特卡罗（MCMC）改进经验积分估计的机制。考虑在有限集上与不可约马尔可夫核 $P$ 相关的有限状态自适应采样动力学，其平稳分布为 $π$，其中转移概率根据经验过度使用而受到惩罚。我们的主要结果是，由此产生的基于TSAW的行走的经验占据计数 $L_t(i)$ 和转移计数 $N_t(i,j)$ 几乎必然地满足：对于每个状态 $i$ 和每条满足 $P_{ij}>0$ 的边 $(i,j)$，有 \[ L_t(i)-tπ_i = O(\sqrt{\log t}) \quad\text{和}\quad N_t(i,j)-tπ_iP_{ij}=O(\sqrt{\log t}). \] 因此，对于每个有界函数 $f:V\to\mathbb R$，我们的积分估计器的误差几乎必然地收敛为 \[ \left|\frac1t\sum_{s=0}^{t-1} f(X_s)-\sum_{i\in V}π_i f(i)\right| = O\left(\frac{\sqrt{\log t}}{t}\right). \] 这些结果表明，与标准随机游走方法下经验平均的通常 $t^{-1/2}$ 误差标度相比，基于TSAW的估计器几乎必然地产生阶为 $O(\sqrt{\log t}/t)$ 的经验积分误差，从而实现了对样本量 $t$ 显著更优的依赖关系。

相关内容

马尔可夫链

关注 289

马尔可夫链，因安德烈·马尔可夫（A.A.Markov，1856－1922）得名，是指数学中具有马尔可夫性质的离散事件随机过程。该过程中，在给定当前知识或信息的情况下，过去（即当前以前的历史状态）对于预测将来（即当前以后的未来状态）是无关的。在马尔可夫链的每一步，系统根据概率分布，可以从一个状态变到另一个状态，也可以保持当前状态。状态的改变叫做转移，与不同的状态改变相关的概率叫做转移概率。随机漫步就是马尔可夫链的例子。随机漫步中每一步的状态是在图形中的点，每一步可以移动到任何一个相邻的点，在这里移动到每一个点的概率都是相同的（无论之前漫步路径是如何的）。

最新，DeepSeek-R1论文登上Nature封面，附83页补充材料

专知会员服务

27+阅读 · 2025年9月18日

【微软亚研】rStar-Math：小型大语言模型通过自我进化的深度思维掌握数学推理

专知会员服务

24+阅读 · 2025年1月13日

【2023新书】马尔可夫链吉布斯场，蒙特卡罗模拟和队列，564页pdf

专知会员服务

63+阅读 · 2023年3月8日

【AI+兵棋推演】60页paper速读：美国空军兵棋推演多物网络行动路线自动分析方法，The wargame commodity course of action automated analysis method

专知会员服务

93+阅读 · 2022年3月18日