Convergence of Some Convex Message Passing Algorithms to a Fixed Point

A popular approach to the MAP inference problem in graphical models is to minimize an upper bound obtained from a dual linear programming or Lagrangian relaxation by (block-)coordinate descent. This is also known as convex/convergent message passing; examples are max-sum diffusion and sequential tree-reweighted message passing (TRW-S). Convergence properties of these methods are currently not fully understood. They have been proved to converge to the set characterized by local consistency of active constraints, with unknown convergence rate; however, it was not clear if the iterates converge at all (to any point). We prove a stronger result (conjectured before but never proved): the iterates converge to a fixed point of the method. Moreover, we show that the algorithm terminates within $\mathcal{O}(1/\varepsilon)$ iterations. We first prove this for a version of coordinate descent applied to a general piecewise-affine convex objective. Then we show that several convex message passing methods are special cases of this method. Finally, we show that a slightly different version of coordinate descent can cycle.

翻译：在图模型的MAP推断问题中，一种流行方法是通过（块）坐标下降来最小化对偶线性规划或拉格朗日松弛得到的上界。这也被称为凸/收敛消息传递；例如最大和扩散与序列树重加权消息传递（TRW-S）。这些方法的收敛性质目前尚未被完全理解。已有证明表明它们会收敛到由活动约束局部一致性所刻画的集合，但收敛速率未知；然而，迭代序列是否收敛（到任意点）尚不明确。我们证明了一个更强的结果（此前曾被猜想但从未得到证明）：迭代序列会收敛到该方法的不动点。此外，我们证明算法在$\mathcal{O}(1/\varepsilon)$次迭代内终止。我们首先针对应用于一般分段仿射凸目标的坐标下降变体证明该结论，随后说明若干凸消息传递方法是该方法的特例。最后，我们证明坐标下降的一个略微不同的变体可能出现循环。

相关内容

坐标下降

关注 0

坐标下降法（coordinate descent）是一种非梯度优化算法。算法在每次迭代中，在当前点处沿一个坐标方向进行一维搜索以求得一个函数的局部极小值。在整个过程中循环使用不同的坐标方向。对于不可拆分的函数而言，算法可能无法在较小的迭代步数中求得最优解。为了加速收敛，可以采用一个适当的坐标系，例如通过主成分分析获得一个坐标间尽可能不相互关联的新坐标系.

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

【亚马逊-WWW2020】不解析,生成!用于面向任务的语义分析的序列到序列体系结构，Don't Parse, Generate! A Sequence to Sequence Architecture for Task-Oriented Semantic Parsing

专知会员服务

15+阅读 · 2020年2月1日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日