Independent Component Alignment for Multi-Task Learning

In a multi-task learning (MTL) setting, a single model is trained to tackle a diverse set of tasks jointly. Despite rapid progress in the field, MTL remains challenging due to optimization issues such as conflicting and dominating gradients. In this work, we propose using a condition number of a linear system of gradients as a stability criterion of an MTL optimization. We theoretically demonstrate that a condition number reflects the aforementioned optimization issues. Accordingly, we present Aligned-MTL, a novel MTL optimization approach based on the proposed criterion, that eliminates instability in the training process by aligning the orthogonal components of the linear system of gradients. While many recent MTL approaches guarantee convergence to a minimum, task trade-offs cannot be specified in advance. In contrast, Aligned-MTL provably converges to an optimal point with pre-defined task-specific weights, which provides more control over the optimization result. Through experiments, we show that the proposed approach consistently improves performance on a diverse set of MTL benchmarks, including semantic and instance segmentation, depth estimation, surface normal estimation, and reinforcement learning. The source code is publicly available at https://github.com/SamsungLabs/MTL .

翻译：在多任务学习场景中，单一模型被训练以联合处理 diverse 任务集合。尽管该领域发展迅速，但由梯度冲突和梯度主导等优化难题引发的挑战依然存在。本文提出以梯度线性系统的条件数作为多任务学习优化的稳定性判据，并从理论上论证条件数能够反映上述优化问题。基于该判据，我们提出 Aligned-MTL——一种新颖的多任务学习优化方法，通过对齐梯度线性系统的正交分量消除训练过程中的不稳定性。近期诸多多任务学习方法虽能保证收敛到极小值点，但无法预先指定任务权衡关系。相比之下，Aligned-MTL 可证明收敛到具有预定义任务特定权重的优化点，从而为优化结果提供更强控制能力。实验表明，所提方法在语义分割、实例分割、深度估计、表面法向估计及强化学习等多任务学习基准测试中持续提升性能。源代码已开源至 https://github.com/SamsungLabs/MTL。

相关内容

多任务学习

关注 162

多任务学习（MTL）是机器学习的一个子领域，可以同时解决多个学习任务，同时利用各个任务之间的共性和差异。与单独训练模型相比，这可以提高特定任务模型的学习效率和预测准确性。多任务学习是归纳传递的一种方法，它通过将相关任务的训练信号中包含的域信息用作归纳偏差来提高泛化能力。通过使用共享表示形式并行学习任务来实现,每个任务所学的知识可以帮助更好地学习其它任务。

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

52+阅读 · 2020年12月14日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

【斯坦福大学】Gradient Surgery for Multi-Task Learning

专知会员服务

47+阅读 · 2020年1月23日