Improvable Gap Balancing for Multi-Task Learning

In multi-task learning (MTL), gradient balancing has recently attracted more research interest than loss balancing since it often leads to better performance. However, loss balancing is much more efficient than gradient balancing, and thus it is still worth further exploration in MTL. Note that prior studies typically ignore that there exist varying improvable gaps across multiple tasks, where the improvable gap per task is defined as the distance between the current training progress and desired final training progress. Therefore, after loss balancing, the performance imbalance still arises in many cases. In this paper, following the loss balancing framework, we propose two novel improvable gap balancing (IGB) algorithms for MTL: one takes a simple heuristic, and the other (for the first time) deploys deep reinforcement learning for MTL. Particularly, instead of directly balancing the losses in MTL, both algorithms choose to dynamically assign task weights for improvable gap balancing. Moreover, we combine IGB and gradient balancing to show the complementarity between the two types of algorithms. Extensive experiments on two benchmark datasets demonstrate that our IGB algorithms lead to the best results in MTL via loss balancing and achieve further improvements when combined with gradient balancing. Code is available at https://github.com/YanqiDai/IGB4MTL.

翻译：在多任务学习（MTL）中，梯度平衡相较于损失平衡近年来吸引了更多研究兴趣，因为其往往能带来更好的性能。然而，损失平衡的效率远高于梯度平衡，因此仍值得在MTL中进一步探索。值得注意的是，先前的研究通常忽略了多个任务之间存在不同的可改进间隙——每个任务的可改进间隙定义为当前训练进度与期望最终训练进度之间的距离。因此，在损失平衡之后，性能不平衡在许多情况下仍然会出现。本文在损失平衡框架下，提出两种新颖的MTL可改进间隙平衡（IGB）算法：一种采用简单启发式方法，另一种（首次）将深度强化学习应用于MTL。特别地，两种算法均不直接平衡MTL中的损失，而是选择动态分配任务权重以实现可改进间隙平衡。此外，我们将IGB与梯度平衡相结合，展示了这两类算法之间的互补性。在两个基准数据集上的大量实验表明，我们的IGB算法通过损失平衡在MTL中取得了最佳结果，并且与梯度平衡结合时实现了进一步性能提升。代码开源地址：https://github.com/YanqiDai/IGB4MTL。

相关内容

多任务学习

关注 162

多任务学习（MTL）是机器学习的一个子领域，可以同时解决多个学习任务，同时利用各个任务之间的共性和差异。与单独训练模型相比，这可以提高特定任务模型的学习效率和预测准确性。多任务学习是归纳传递的一种方法，它通过将相关任务的训练信号中包含的域信息用作归纳偏差来提高泛化能力。通过使用共享表示形式并行学习任务来实现,每个任务所学的知识可以帮助更好地学习其它任务。

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日