Shadow removal and segmentation remain challenging tasks in computer vision, particularly in complex real world scenarios. This study presents a novel approach that enhances the ShadowFormer model by incorporating Masked Autoencoder (MAE) priors and Fast Fourier Convolution (FFC) blocks, leading to significantly faster convergence and improved performance. We introduce key innovations: (1) integration of MAE priors trained on Places2 dataset for better context understanding, (2) adoption of Haar wavelet features for enhanced edge detection and multiscale analysis, and (3) implementation of a modified SAM Adapter for robust shadow segmentation. Extensive experiments on the challenging DESOBA dataset demonstrate that our approach achieves state of the art results, with notable improvements in both convergence speed and shadow removal quality.
翻译:阴影去除与分割在计算机视觉中仍然是具有挑战性的任务,尤其是在复杂的现实世界场景中。本研究提出了一种新颖的方法,通过融入掩码自编码器先验和快速傅里叶卷积块来增强ShadowFormer模型,从而显著加快了收敛速度并提升了性能。我们引入了关键创新:(1) 集成在Places2数据集上训练的MAE先验以获得更好的上下文理解,(2) 采用Haar小波特征以增强边缘检测和多尺度分析,以及(3) 实现了一种改进的SAM适配器以实现鲁棒的阴影分割。在具有挑战性的DESOBA数据集上进行的大量实验表明,我们的方法取得了最先进的结果,在收敛速度和阴影去除质量方面均有显著提升。