With the rapid advancements in autonomous driving and robot navigation, there is a growing demand for lifelong learning models capable of estimating metric (absolute) depth. Lifelong learning approaches potentially offer significant cost savings in terms of model training, data storage, and collection. However, the quality of RGB images and depth maps is sensor-dependent, and depth maps in the real world exhibit domain-specific characteristics, leading to variations in depth ranges. These challenges limit existing methods to lifelong learning scenarios with small domain gaps and relative depth map estimation. To facilitate lifelong metric depth learning, we identify three crucial technical challenges that require attention: i) developing a model capable of addressing the depth scale variation through scale-aware depth learning, ii) devising an effective learning strategy to handle significant domain gaps, and iii) creating an automated solution for domain-aware depth inference in practical applications. Based on the aforementioned considerations, in this paper, we present i) a lightweight multi-head framework that effectively tackles the depth scale imbalance, ii) an uncertainty-aware lifelong learning solution that adeptly handles significant domain gaps, and iii) an online domain-specific predictor selection method for real-time inference. Through extensive numerical studies, we show that the proposed method can achieve good efficiency, stability, and plasticity, leading the benchmarks by 8% to 15%.
翻译:摘要:随着自动驾驶和机器人导航技术的快速发展,对能够估计度量(绝对)深度的终身学习模型的需求日益增长。终身学习方法在模型训练、数据存储和采集方面具有显著的潜在成本优势。然而,RGB图像和深度图的质量依赖于传感器,且现实世界中的深度图呈现领域特定特征,导致深度范围存在差异。这些挑战将现有方法局限于领域间隙较小且仅需估计相对深度图的终身学习场景。为促进终身度量深度学习,我们识别出三个需要关注的关键技术挑战:i) 开发能够通过尺度感知深度学习解决深度尺度变化的模型;ii) 设计有效的学习策略以处理显著领域间隙;iii) 在实际应用中创建领域感知深度推理的自动化解决方案。基于上述考虑,本文提出:i) 轻量级多头框架,有效应对深度尺度不平衡问题;ii) 不确定性感知的终身学习解决方案,巧妙处理显著领域间隙;iii) 用于实时推理的在线领域特定预测器选择方法。通过大量数值实验表明,所提方法在效率、稳定性和可塑性方面表现优异,将基准性能提升8%至15%。