Data imbalance and open-ended distribution are two intrinsic characteristics of the real visual world. Though encouraging progress has been made in tackling each challenge separately, few works dedicated to combining them towards real-world scenarios. While several previous works have focused on classifying close-set samples and detecting open-set samples during testing, it's still essential to be able to classify unknown subjects as human beings. In this paper, we formally define a more realistic task as distribution-agnostic generalized category discovery (DA-GCD): generating fine-grained predictions for both close- and open-set classes in a long-tailed open-world setting. To tackle the challenging problem, we propose a Self-Balanced Co-Advice contrastive framework (BaCon), which consists of a contrastive-learning branch and a pseudo-labeling branch, working collaboratively to provide interactive supervision to resolve the DA-GCD task. In particular, the contrastive-learning branch provides reliable distribution estimation to regularize the predictions of the pseudo-labeling branch, which in turn guides contrastive learning through self-balanced knowledge transfer and a proposed novel contrastive loss. We compare BaCon with state-of-the-art methods from two closely related fields: imbalanced semi-supervised learning and generalized category discovery. The effectiveness of BaCon is demonstrated with superior performance over all baselines and comprehensive analysis across various datasets. Our code is publicly available.
翻译:数据不均衡与开放分布是真实视觉世界的两个固有特征。尽管在分别应对每个挑战方面已取得令人鼓舞的进展,但很少有研究致力于将二者结合以应对现实场景。虽然已有若干工作聚焦于测试时对闭集样本进行分类并检测开集样本,但具备像人类一样对未知主体进行分类的能力仍至关重要。本文正式定义了一个更具现实意义的任务——分布无关的广义类别发现(DA-GCD):在长尾开放世界环境中对闭集与开集类别生成细粒度预测。为应对这一挑战性难题,我们提出了一种自平衡协同引导对比学习框架(BaCon),该框架由对比学习分支与伪标签分支组成,通过协同工作提供交互式监督以解决DA-GCD任务。具体而言,对比学习分支提供可靠的分布估计来正则化伪标签分支的预测结果,而伪标签分支则通过自平衡知识迁移及提出的新型对比损失函数反向指导对比学习。我们将BaCon与来自两个紧密相关领域(不均衡半监督学习与广义类别发现)的最新方法进行了对比。通过在各数据集上的全面分析与优于所有基线方法的卓越性能,BaCon的有效性得到了充分验证。我们的代码已公开。