DiffMatch: Diffusion Model for Dense Matching

The objective for establishing dense correspondence between paired images consists of two terms: a data term and a prior term. While conventional techniques focused on defining hand-designed prior terms, which are difficult to formulate, recent approaches have focused on learning the data term with deep neural networks without explicitly modeling the prior, assuming that the model itself has the capacity to learn an optimal prior from a large-scale dataset. The performance improvement was obvious, however, they often fail to address inherent ambiguities of matching, such as textureless regions, repetitive patterns, and large displacements. To address this, we propose DiffMatch, a novel conditional diffusion-based framework designed to explicitly model both the data and prior terms. Unlike previous approaches, this is accomplished by leveraging a conditional denoising diffusion model. DiffMatch consists of two main components: conditional denoising diffusion module and cost injection module. We stabilize the training process and reduce memory usage with a stage-wise training strategy. Furthermore, to boost performance, we introduce an inference technique that finds a better path to the accurate matching field. Our experimental results demonstrate significant performance improvements of our method over existing approaches, and the ablation studies validate our design choices along with the effectiveness of each component. Project page is available at https://ku-cvlab.github.io/DiffMatch/.

翻译：建立配对图像间密集对应的目标包含两项：数据项与先验项。传统方法侧重于定义手工设计的先验项，这难以进行公式化表述，而近期方法则通过深度神经网络学习数据项，不显式建模先验，假设模型本身具备从大规模数据集中学习最优先验的能力。尽管性能提升显著，但这类方法常无法应对匹配固有的歧义性，如无纹理区域、重复模式及大幅度位移。为解决此问题，我们提出DiffMatch——一种基于条件扩散的新型框架，旨在显式建模数据项与先验项。不同于先前方法，我们通过利用条件去噪扩散模型实现这一目标。DiffMatch包含两个核心模块：条件去噪扩散模块与代价注入模块。我们采用分阶段训练策略来稳定训练过程并降低内存消耗。此外，为提升性能，我们引入一种推理技术，能够为精确匹配场寻找更优路径。实验结果表明，我们的方法相较于现有方法取得了显著性能提升，消融研究验证了设计选择及各组件的有效性。项目主页参见https://ku-cvlab.github.io/DiffMatch/。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

因果图，Causal Graphs，52页ppt

专知会员服务

254+阅读 · 2020年4月19日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

116+阅读 · 2020年4月5日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日