This paper addresses the problem of rolling shutter correction in complex nonlinear and dynamic scenes with extreme occlusion. Existing methods suffer from two main drawbacks. Firstly, they face challenges in estimating the accurate correction field due to the uniform velocity assumption, leading to significant image correction errors under complex motion. Secondly, the drastic occlusion in dynamic scenes prevents current solutions from achieving better image quality because of the inherent difficulties in aligning and aggregating multiple frames. To tackle these challenges, we model the curvilinear trajectory of pixels analytically and propose a geometry-based Quadratic Rolling Shutter (QRS) motion solver, which precisely estimates the high-order correction field of individual pixels. Besides, to reconstruct high-quality occlusion frames in dynamic scenes, we present a 3D video architecture that effectively Aligns and Aggregates multi-frame context, namely, RSA2-Net. We evaluate our method across a broad range of cameras and video sequences, demonstrating its significant superiority. Specifically, our method surpasses the state-of-the-art by +4.98, +0.77, and +4.33 of PSNR on Carla-RS, Fastec-RS, and BS-RSC datasets, respectively. Code is available at https://github.com/DelinQu/qrsc.
翻译:本文针对存在极端遮挡的复杂非线性和动态场景中的卷帘快门校正问题展开研究。现有方法存在两大局限:其一,由于匀速运动假设难以准确估计校正场,导致复杂运动场景下图像校正误差显著;其二,动态场景中的剧烈遮挡使得现有方案因多帧对齐与聚合的固有问题而难以获得更优的图像质量。为攻克这些挑战,我们通过解析建模像素的曲线运动轨迹,提出基于几何的二次卷帘快门(QRS)运动求解器,可精确估计单像素的高阶校正场。此外,为重建动态场景中的高质量遮挡帧,我们构建了名为RSA2-Net的3D视频架构,能有效对齐并聚合多帧上下文信息。在涵盖多种相机与视频序列的广泛评估中,本方法展现出显著优越性。具体而言,在Carla-RS、Fastec-RS和BS-RSC数据集上,我们的方法相较现有最优技术PSNR指标分别提升+4.98、+0.77和+4.33。代码已开源至https://github.com/DelinQu/qrsc。