Distance-based supervised method, the minimal learning machine, constructs a predictive model from data by learning a mapping between input and output distance matrices. In this paper, we propose new methods and evaluate how their core component, the distance mapping, can be adapted to multi-label learning. The proposed approach is based on combining the distance mapping with an inverse distance weighting. Although the proposal is one of the simplest methods in the multi-label learning literature, it achieves state-of-the-art performance for small to moderate-sized multi-label learning problems. In addition to its simplicity, the proposed method is fully deterministic: Its hyper-parameter can be selected via ranking loss-based statistic which has a closed form, thus avoiding conventional cross-validation-based hyper-parameter tuning. In addition, due to its simple linear distance mapping-based construction, we demonstrate that the proposed method can assess the uncertainty of the predictions for multi-label classification, which is a valuable capability for data-centric machine learning pipelines.
翻译:基于距离的监督方法——最小学习机,通过学习输入与输出距离矩阵之间的映射关系,从数据中构建预测模型。本文提出新方法并评估其核心组件(距离映射)如何适配多标签学习场景。所提方法基于距离映射与逆距离加权的结合。尽管该方案是多标签学习文献中最简单的方法之一,但在中小规模多标签学习问题上达到了最先进的性能。除简洁性外,该方法完全确定性:其超参数可通过基于排序损失的闭式统计量进行选择,从而避免传统的基于交叉验证的超参数调优。此外,基于其简单的线性距离映射结构,我们证明了所提方法能够评估多标签分类预测的不确定性,这对以数据为中心的机器学习流程具有重要价值。