We consider \emph{Gibbs distributions}, which are families of probability distributions over a discrete space $\Omega$ with probability mass function of the form $\mu^\Omega_\beta(\omega) \propto e^{\beta H(\omega)}$ for $\beta$ in an interval $[\beta_{\min}, \beta_{\max}]$ and $H( \omega ) \in \{0 \} \cup [1, n]$. The \emph{partition function} is the normalization factor $Z(\beta)=\sum_{\omega \in\Omega}e^{\beta H(\omega)}$. Two important parameters of these distributions are the log partition ratio $q = \log \tfrac{Z(\beta_{\max})}{Z(\beta_{\min})}$ and the counts $c_x = |H^{-1}(x)|$. These are correlated with system parameters in a number of physical applications and sampling algorithms. Our first main result is to estimate the counts $c_x$ using roughly $\tilde O( \frac{q}{\varepsilon^2})$ samples for general Gibbs distributions and $\tilde O( \frac{n^2}{\varepsilon^2} )$ samples for integer-valued distributions (ignoring some second-order terms and parameters), and we show this is optimal up to logarithmic factors. We illustrate with improved algorithms for counting connected subgraphs, independent sets, and perfect matchings. As a key subroutine, we also develop algorithms to compute the partition function $Z$ using $\tilde O(\frac{q}{\varepsilon^2})$ samples for general Gibbs distributions and using $\tilde O(\frac{n^2}{\varepsilon^2})$ samples for integer-valued distributions.
翻译:我们考虑\emph{吉布斯分布},这是离散空间$\Omega$上的一类概率分布族,其概率质量函数形式为$\mu^\Omega_\beta(\omega) \propto e^{\beta H(\omega)}$,其中$\beta$位于区间$[\beta_{\min}, \beta_{\max}]$内,且$H( \omega ) \in \{0 \} \cup [1, n]$。\emph{配分函数}为归一化因子$Z(\beta)=\sum_{\omega \in\Omega}e^{\beta H(\omega)}$。这些分布的两个重要参数是对数配分比$q = \log \tfrac{Z(\beta_{\max})}{Z(\beta_{\min})}$和计数$c_x = |H^{-1}(x)|$。在众多物理应用和采样算法中,它们与系统参数存在相关性。我们的第一个主要结果是:对于一般吉布斯分布,利用约$\tilde O( \frac{q}{\varepsilon^2})$个样本估计计数$c_x$;对于整数值分布,利用约$\tilde O( \frac{n^2}{\varepsilon^2} )$个样本(忽略部分二阶项和参数),并证明该结果在对数因子意义下最优。我们通过改进的连通子图计数、独立集计数和完美匹配计数算法加以说明。作为关键子程序,我们还开发了计算配分函数$Z$的算法:对于一般吉布斯分布需要约$\tilde O(\frac{q}{\varepsilon^2})$个样本,对于整数值分布需要约$\tilde O(\frac{n^2}{\varepsilon^2})$个样本。