Real-world visual search systems involve deployments on multiple platforms with different computing and storage resources. Deploying a unified model that suits the minimal-constrain platforms leads to limited accuracy. It is expected to deploy models with different capacities adapting to the resource constraints, which requires features extracted by these models to be aligned in the metric space. The method to achieve feature alignments is called ``compatible learning''. Existing research mainly focuses on the one-to-one compatible paradigm, which is limited in learning compatibility among multiple models. We propose a Switchable representation learning Framework with Self-Compatibility (SFSC). SFSC generates a series of compatible sub-models with different capacities through one training process. The optimization of sub-models faces gradients conflict, and we mitigate this problem from the perspective of the magnitude and direction. We adjust the priorities of sub-models dynamically through uncertainty estimation to co-optimize sub-models properly. Besides, the gradients with conflicting directions are projected to avoid mutual interference. SFSC achieves state-of-the-art performance on the evaluated datasets.
翻译:现实世界的视觉搜索系统涉及在具有不同计算和存储资源的多个平台上部署。部署适合最小约束平台的统一模型会导致精度受限。期望能够部署不同容量的模型以适应资源约束,这就需要这些模型提取的特征在度量空间中对齐。实现特征对齐的方法称为“兼容学习”。现有研究主要集中于一对一兼容范式,该范式在多个模型间学习兼容性方面存在局限。我们提出了一种具有自兼容性的可切换表征学习框架(SFSC)。SFSC通过单次训练过程生成一系列具有不同容量的兼容子模型。子模型的优化面临梯度冲突问题,我们从梯度的幅值和方向两个角度缓解该问题。通过不确定性估计动态调整子模型的优先级,以合理协同优化子模型。此外,对方向冲突的梯度进行投影以避免相互干扰。SFSC在多个评估数据集上达到了最先进的性能。