Real-world visual search systems involve deployments on multiple platforms with different computing and storage resources. Deploying a unified model that suits the minimal-constrain platforms leads to limited accuracy. It is expected to deploy models with different capacities adapting to the resource constraints, which requires features extracted by these models to be aligned in the metric space. The method to achieve feature alignments is called ``compatible learning''. Existing research mainly focuses on the one-to-one compatible paradigm, which is limited in learning compatibility among multiple models. We propose a \textbf{S}witchable representation learning Framework with Self-Compatibility (SFSC). SFSC generates a series of compatible sub-models with different capacities through one training process. The optimization of sub-models faces gradients conflict, and we mitigate this problem from the perspective of the magnitude and direction. We adjust the priorities of sub-models dynamically through uncertainty estimation to co-optimize sub-models properly. Besides, the gradients with conflicting directions are projected to avoid mutual interference. SFSC achieves state-of-the-art performance on the evaluated datasets.
翻译:现实世界中的视觉搜索系统涉及部署在具有不同计算和存储资源的多个平台上。部署适合最小约束平台的统一模型会导致精度受限。期望部署具有不同容量以适应资源约束的模型,这要求这些模型提取的特征在度量空间中对齐。实现特征对齐的方法称为"兼容学习"。现有研究主要集中在"一对一"兼容范式上,这在学习多个模型之间的兼容性方面存在局限性。我们提出了一个基于**自兼容**的可切换表示学习框架(SFSC)。SFSC通过一次训练过程生成一系列具有不同容量的兼容子模型。子模型的优化面临梯度冲突问题,我们从幅度和方向两个角度缓解这一问题。通过不确定性估计动态调整子模型的优先级,以合理协同优化子模型。此外,对方向冲突的梯度进行投影以避免相互干扰。SFSC在评估数据集上达到了最先进的性能。