Image ordinal regression has been mainly studied along the line of exploiting the order of categories. However, the issues of class imbalance and category overlap that are very common in ordinal regression were largely overlooked. As a result, the performance on minority categories is often unsatisfactory. In this paper, we propose a novel framework called CIG based on controllable image generation to directly tackle these two issues. Our main idea is to generate extra training samples with specific labels near category boundaries, and the sample generation is biased toward the less-represented categories. To achieve controllable image generation, we seek to separate structural and categorical information of images based on structural similarity, categorical similarity, and reconstruction constraints. We evaluate the effectiveness of our new CIG approach in three different image ordinal regression scenarios. The results demonstrate that CIG can be flexibly integrated with off-the-shelf image encoders or ordinal regression models to achieve improvement, and further, the improvement is more significant for minority categories.
翻译:图像序数回归主要沿着利用类别顺序的研究路线进行。然而,序数回归中非常常见的类别不平衡和类别重叠问题在很大程度上被忽视了,导致少数类别的性能往往不尽如人意。本文提出一种基于可控图像生成的新型框架CIG,直接解决这两个问题。我们的核心思想是在类别边界附近生成带有特定标签的额外训练样本,并且样本生成偏向于代表性不足的类别。为实现可控图像生成,我们基于结构相似性、类别相似性和重构约束,试图分离图像的结构信息和类别信息。我们在三种不同的图像序数回归场景中评估了新型CIG方法的有效性。结果表明,CIG能够灵活地与现成的图像编码器或序数回归模型集成以提升性能,并且对于少数类别的提升更为显著。