Finding Hall blockers by matrix scaling

For a given nonnegative matrix $A=(A_{ij})$, the matrix scaling problem asks whether $A$ can be scaled to a doubly stochastic matrix $D_1AD_2$ for some positive diagonal matrices $D_1,D_2$.The Sinkhorn algorithm is a simple iterative algorithm, which repeats row-normalization $A_{ij} \leftarrow A_{ij}/\sum_{j}A_{ij}$ and column-normalization $A_{ij} \leftarrow A_{ij}/\sum_{i}A_{ij}$ alternatively. By this algorithm, $A$ converges to a doubly stochastic matrix in limit if and only if the bipartite graph associated with $A$ has a perfect matching. This property can decide the existence of a perfect matching in a given bipartite graph $G$, which is identified with the $0,1$-matrix $A_G$.Linial, Samorodnitsky, and Wigderson showed that $O(n^2 \log n)$ iterations for $A_G$ decide whether $G$ has a perfect matching. Here $n$ is the number of vertices in one of the color classes of $G$. In this paper, we show an extension of this result:If $G$ has no perfect matching, then a polynomial number of the Sinkhorn iterations identifies a Hall blocker -- a vertex subset $X$ having neighbors $\Gamma(X)$ with $|X| > |\Gamma(X)|$. Specifically, we show that $O(n^2 \log n)$ iterations can identify one Hall blocker, and that further polynomial iterations can also identify all parametric Hall blockers $X$ of maximizing $(1-\lambda) |X| - \lambda |\Gamma(X)|$ for $\lambda \in [0,1]$.The former result is based on an interpretation of the Sinkhorn algorithm as alternating minimization for geometric programming. The latter is on an interpretation as alternating minimization for KL-divergence (Csisz\'{a}r and Tusn\'{a}dy 1984, Gietl and Reffel 2013) and its limiting behavior for a nonscalable matrix (Aas 2014). We also relate the Sinkhorn limit with parametric network flow, principal partition of polymatroids, and the Dulmage-Mendelsohn decomposition of a bipartite graph.

翻译：对于给定的非负矩阵 $A=(A_{ij})$，矩阵缩放问题考察是否存在正对角矩阵 $D_1,D_2$，使得 $A$ 可被缩放为双随机矩阵 $D_1AD_2$。Sinkhorn算法是一种简单的迭代算法，交替执行行归一化 $A_{ij} \leftarrow A_{ij}/\sum_{j}A_{ij}$ 与列归一化 $A_{ij} \leftarrow A_{ij}/\sum_{i}A_{ij}$。通过此算法，当且仅当 $A$ 关联的二部图具有完美匹配时，$A$ 在极限下收敛至双随机矩阵。该性质可用于判定给定二部图 $G$（其对应 $0,1$ 矩阵记为 $A_G$）是否存在完美匹配。Linial、Samorodnitsky与Wigderson证明，对 $A_G$ 执行 $O(n^2 \log n)$ 次迭代即可判定 $G$ 是否存在完美匹配，其中 $n$ 为 $G$ 某一色部顶点数。本文证明该结果的推广：若 $G$ 无完美匹配，则多项式次Sinkhorn迭代可识别出一个霍尔阻碍——即满足 $|X| > |\Gamma(X)|$ 的顶点子集 $X$（$\Gamma(X)$ 为其邻域）。具体而言，我们证明 $O(n^2 \log n)$ 次迭代可识别一个霍尔阻碍，进一步的多项式迭代可识别所有参数化霍尔阻碍 $X$，即最大化 $(1-\lambda) |X| - \lambda |\Gamma(X)|$（$\lambda \in [0,1]$）的子集。前者结果基于将Sinkhorn算法解释为几何规划的交替最小化，后者则基于将其解释为KL散度的交替最小化（Csiszár与Tusnády 1984，Gietl与Reffel 2013）及其对不可缩放矩阵的极限行为（Aas 2014）。我们还将Sinkhorn极限与参数化网络流、多拟阵主划分及二部图的Dulmage-Mendelsohn分解建立了联系。