Long-tailed classification poses a challenge due to its heavy imbalance in class probabilities and tail-sensitivity risks with asymmetric misprediction costs. Recent attempts have used re-balancing loss and ensemble methods, but they are largely heuristic and depend heavily on empirical results, lacking theoretical explanation. Furthermore, existing methods overlook the decision loss, which characterizes different costs associated with tailed classes. This paper presents a general and principled framework from a Bayesian-decision-theory perspective, which unifies existing techniques including re-balancing and ensemble methods, and provides theoretical justifications for their effectiveness. From this perspective, we derive a novel objective based on the integrated risk and a Bayesian deep-ensemble approach to improve the accuracy of all classes, especially the ``tail". Besides, our framework allows for task-adaptive decision loss which provides provably optimal decisions in varying task scenarios, along with the capability to quantify uncertainty. Finally, We conduct comprehensive experiments, including standard classification, tail-sensitive classification with a new False Head Rate metric, calibration, and ablation studies. Our framework significantly improves the current SOTA even on large-scale real-world datasets like ImageNet.
翻译:长尾分类因其类别概率的严重不平衡以及不对称误分类代价导致的尾部敏感风险而面临挑战。近期尝试使用重平衡损失与集成方法,但这些方法大多基于启发式思路并严重依赖实证结果,缺乏理论解释。此外,现有方法忽视了刻画尾部类别不同代价的决策损失。本文从贝叶斯决策理论视角提出一个通用且原则化的框架,该框架统一了包括重平衡与集成方法在内的现有技术,并为其有效性提供了理论依据。基于此视角,我们推导出一个基于集成风险的新目标函数,并设计了一种贝叶斯深度集成方法来提升所有类别的准确率,尤其是"尾部"类别。此外,我们的框架支持任务自适应决策损失,可在不同任务场景下提供可证明的最优决策,并具备量化不确定性的能力。最后,我们开展了全面实验,包括标准分类、基于新"假头部率"指标的尾部敏感分类、校准及消融研究。即使在大规模真实数据集(如ImageNet)上,我们的框架也显著超越了当前最优水平。