Confusion-Aware Transfer Teacher Curriculum Learning Framework: Disentangling Scoring and Pacing Effects

Savini Kommalage,Sanka Mohottala,Asiri Gawesha,Dulara Madhusanka,Menan Velayuthan,Dharshana Kasthurirathna,Mahima Milinda Alwis Weerasinghe,Charith Abhayaratne

from arxiv, Accepted at International Conference on Machine Learning (ICML) GlobalSouthML Workshop (2026)

Curriculum learning couples two design choices, how samples are scored by difficulty and how harder samples are paced into training, making it difficult to attribute observed gains to either component. We disentangle these factors with two evaluation protocols: stage-wise test subsets that validate scoring functions independently of curriculum training, and a baseline that applies the same pacing schedule to randomly ordered data. Within the Transfer Teacher framework (TTF), we use these protocols to evaluate a confusion-aware difficulty score that considers both correct-class confidence and the probability distribution over incorrect classes. On CIFAR-10 with ResNet-18 and VGG-16, the proposed score produces model-interpretable difficulty rankings that align with human intuition. However, at full data, neither curriculum nor anti-curriculum ordering improves accuracy over standard training, indicating that improving the scoring function alone is insufficient to overcome the known failure modes of curriculum learning in TTF. In contrast, We find that confusion-aware curriculum ordering result in consistent data-efficiency benefits, outperforming random ordering by up to 8.7% points at the 20% data regime, suggesting the potential of TTF as a data-efficient training method.

翻译：摘要：课程学习将两个设计选择耦合在一起：即如何根据难度对样本进行评分，以及如何将更难的样本逐步引入训练，这使得难以将观察到的性能提升归因于任一组件。我们通过两种评估协议解耦这些因素：阶段式测试子集（独立于课程训练验证评分函数），以及一种基线方法（对随机排序的数据应用相同的节奏调度）。在传递教师框架（TTF）中，我们利用这些协议评估一种困惑感知难度评分，该评分同时考虑正确类别的置信度以及错误类别的概率分布。在CIFAR-10数据集上使用ResNet-18和VGG-16网络时，所提出的评分能够生成模型可解释的难度排序结果，且与人类直觉一致。然而，在全量数据条件下，无论是课程排序还是反课程排序均未比标准训练提升准确率，这表明仅改进评分函数不足以克服TTF中课程学习已知的失效模式。相反，我们发现困惑感知课程排序在数据效率方面具有持续优势，在20%数据量场景下相比随机排序提升高达8.7个百分点，揭示了TTF作为数据高效训练方法的潜力。

相关内容

课程

关注 6

课程是指学校学生所应学习的学科总和及其进程与安排。课程是对教育的目标、教学内容、教学活动方式的规划和设计，是教学计划、教学大纲等诸多方面实施过程的总和。广义的课程是指学校为实现培养目标而选择的教育内容及其进程的总和，它包括学校老师所教授的各门学科和有目的、有计划的教育活动。狭义的课程是指某一门学科。专知上对国内外最新AI+X的课程进行了收集与索引，涵盖斯坦福大学、CMU、MIT、清华、北大等名校开放课程。

【博士论文】知识引导的序列决策算法：图结构、专家示教、人类及跨智能体经验的融合研究

专知会员服务

20+阅读 · 4月10日

TPAMI 2021 | 首篇《课程学习》综述，清华大学朱文武团队

专知会员服务

24+阅读 · 2021年7月19日

首篇「课程学习（Curriculum Learning)」2021综述论文

专知会员服务

50+阅读 · 2021年1月31日

【KDD2020-Tutorial】因果推理与稳定学习，Causal Inference and Stable Learning

专知会员服务

87+阅读 · 2020年8月28日