Modeling Stereo-Confidence Out of the End-to-End Stereo-Matching Network via Disparity Plane Sweep

We propose a novel stereo-confidence that can be measured externally to various stereo-matching networks, offering an alternative input modality choice of the cost volume for learning-based approaches, especially in safety-critical systems. Grounded in the foundational concepts of disparity definition and the disparity plane sweep, the proposed stereo-confidence method is built upon the idea that any shift in a stereo-image pair should be updated in a corresponding amount shift in the disparity map. Based on this idea, the proposed stereo-confidence method can be summarized in three folds. 1) Using the disparity plane sweep, multiple disparity maps can be obtained and treated as a 3-D volume (predicted disparity volume), like the cost volume is constructed. 2) One of these disparity maps serves as an anchor, allowing us to define a desirable (or ideal) disparity profile at every spatial point. 3) By comparing the desirable and predicted disparity profiles, we can quantify the level of matching ambiguity between left and right images for confidence measurement. Extensive experimental results using various stereo-matching networks and datasets demonstrate that the proposed stereo-confidence method not only shows competitive performance on its own but also consistent performance improvements when it is used as an input modality for learning-based stereo-confidence methods.

翻译：摘要：我们提出了一种新颖的立体置信度方法，该方法可在各种立体匹配网络外部进行测量，为基于学习的方法（尤其在安全关键系统中）提供了成本体积的替代输入模态选择。基于视差定义与视差平面扫描的基本概念，所提出的立体置信度方法建立在以下核心理念上：立体图像对中的任何位移，都应体现在视差图的对应位移量更新中。基于此，该方法可归纳为三个要点：1）通过视差平面扫描，可获取多幅视差图并将其视为三维体积（预测视差体积），类似于成本体积的构建方式；2）以其中一幅视差图为锚点，可在每个空间点定义期望（或理想）视差轮廓；3）通过比较期望视差轮廓与预测视差轮廓，可量化左右图像间的匹配歧义程度，从而实现置信度测量。在多种立体匹配网络与数据集上的大量实验结果表明，所提出的立体置信度方法不仅自身展现出竞争性能，而且在将其作为基于学习的立体置信度方法的输入模态时，能带来持续的性能提升。

相关内容

Networking

关注 23

Networking：IFIP International Conferences on Networking。 Explanation：国际网络会议。 Publisher：IFIP。 SIT： http://dblp.uni-trier.de/db/conf/networking/index.html

O’Reilly报告：知识图谱崛起——面向现代数据集成和数据结构体系，“The Rise of the Knowledge Graph——Toward Modern Data Integration and the Data Fabric Architecture”

专知会员服务

49+阅读 · 2022年2月18日

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日