In this paper, we introduce the imprecise label learning (ILL) framework, a unified approach to handle various imprecise label configurations, which are commonplace challenges in machine learning tasks. ILL leverages an expectation-maximization (EM) algorithm for the maximum likelihood estimation (MLE) of the imprecise label information, treating the precise labels as latent variables. Compared to previous versatile methods attempting to infer correct labels from the imprecise label information, our ILL framework considers all possible labeling imposed by the imprecise label information, allowing a unified solution to deal with any imprecise labels. With comprehensive experimental results, we demonstrate that ILL can seamlessly adapt to various situations, including partial label learning, semi-supervised learning, noisy label learning, and a mixture of these settings. Notably, our simple method surpasses the existing techniques for handling imprecise labels, marking the first unified framework with robust and effective performance across various imprecise labels. We believe that our approach has the potential to significantly enhance the performance of machine learning models on tasks where obtaining precise labels is expensive and complicated. We hope our work will inspire further research on this topic with an open-source codebase release.
翻译:本文提出了不精确标签学习(ILL)框架,这是一种统一方法,用于处理机器学习任务中常见的各类不精确标签配置。ILL采用期望最大化(EM)算法对不精确标签信息进行极大似然估计(MLE),并将精确标签视为潜变量。与以往试图从不精确标签信息中推断正确标签的通用方法不同,我们的ILL框架考虑了不精确标签信息所施加的所有可能的标注方式,从而提供了一种处理任何不精确标签的统一解决方案。通过全面的实验结果表明,ILL能够无缝适应各种场景,包括部分标签学习、半监督学习、噪声标签学习以及这些设置的混合情况。值得注意的是,我们提出的简单方法超越了现有处理不精确标签的技术,成为首个能够在多种不精确标签下实现稳健且有效性能的统一框架。我们相信,该方法有望显著提升机器学习模型在获取精确标签成本高昂且复杂任务中的性能。我们希望通过开源代码库的发布,这一工作能够激发该方向的进一步研究。