Fourier Boundary Features Network with Wider Catchers for Glass Segmentation

Glass largely blurs the boundary between the real world and the reflection. The special transmittance and reflectance quality have confused the semantic tasks related to machine vision. Therefore, how to clear the boundary built by glass, and avoid over-capturing features as false positive information in deep structure, matters for constraining the segmentation of reflection surface and penetrating glass. We proposed the Fourier Boundary Features Network with Wider Catchers (FBWC), which might be the first attempt to utilize sufficiently wide horizontal shallow branches without vertical deepening for guiding the fine granularity segmentation boundary through primary glass semantic information. Specifically, we designed the Wider Coarse-Catchers (WCC) for anchoring large area segmentation and reducing excessive extraction from a structural perspective. We embed fine-grained features by Cross Transpose Attention (CTA), which is introduced to avoid the incomplete area within the boundary caused by reflection noise. For excavating glass features and balancing high-low layers context, a learnable Fourier Convolution Controller (FCC) is proposed to regulate information integration robustly. The proposed method has been validated on three different public glass segmentation datasets. Experimental results reveal that the proposed method yields better segmentation performance compared with the state-of-the-art (SOTA) methods in glass image segmentation.

翻译：玻璃在很大程度上模糊了真实世界与反射之间的边界。其特殊的透射与反射特性对机器视觉相关的语义任务造成了困扰。因此，如何廓清玻璃构筑的边界，并在深层结构中避免因过度捕捉特征而产生虚假阳性信息，对于约束反射表面与穿透性玻璃的分割至关重要。我们提出了具有更宽捕捉器的傅里叶边界特征网络（FBWC），这可能是首个尝试利用充分宽的浅层水平分支（而非垂直加深）通过初级玻璃语义信息引导细粒度边界分割的方法。具体而言，我们设计了宽域粗粒度捕捉器（WCC），用于锚定大面积分割区域并从结构层面减少过度提取。我们通过交叉转置注意力机制（CTA）嵌入细粒度特征，旨在避免由反射噪声导致的边界内部不完整区域。为挖掘玻璃特征并平衡高低层上下文，我们提出了一种可学习的傅里叶卷积控制器（FCC），以稳健地调控信息整合。该方法已在三个不同的公开玻璃分割数据集上得到验证。实验结果表明，与当前最先进（SOTA）方法相比，所提方法在玻璃图像分割中取得了更优的分割性能。

相关内容

Networking

关注 23

Networking：IFIP International Conferences on Networking。 Explanation：国际网络会议。 Publisher：IFIP。 SIT： http://dblp.uni-trier.de/db/conf/networking/index.html

O’Reilly报告：知识图谱崛起——面向现代数据集成和数据结构体系，“The Rise of the Knowledge Graph——Toward Modern Data Integration and the Data Fabric Architecture”

专知会员服务

49+阅读 · 2022年2月18日

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日