MTLComb: multi-task learning combining regression and classification tasks for joint feature selection

Multi-task learning (MTL) is a learning paradigm that enables the simultaneous training of multiple communicating algorithms. Although MTL has been successfully applied to ether regression or classification tasks alone, incorporating mixed types of tasks into a unified MTL framework remains challenging, primarily due to variations in the magnitudes of losses associated with different tasks. This challenge, particularly evident in MTL applications with joint feature selection, often results in biased selections. To overcome this obstacle, we propose a provable loss weighting scheme that analytically determines the optimal weights for balancing regression and classification tasks. This scheme significantly mitigates the otherwise biased feature selection. Building upon this scheme, we introduce MTLComb, an MTL algorithm and software package encompassing optimization procedures, training protocols, and hyperparameter estimation procedures. MTLComb is designed for learning shared predictors among tasks of mixed types. To showcase the efficacy of MTLComb, we conduct tests on both simulated data and biomedical studies pertaining to sepsis and schizophrenia.

翻译：多任务学习（MTL）是一种能够同时训练多个通信算法的学习范式。尽管MTL已成功应用于单独的回归或分类任务，但将混合类型的任务纳入统一MTL框架仍具挑战性，主要原因是不同任务相关损失量级的差异。这一挑战在涉及联合特征选择的MTL应用中尤为突出，常导致有偏的特征选择。为克服这一障碍，我们提出了一种可证明的损失加权方案，该方案通过分析确定回归与分类任务的最优权重，从而显著缓解原本存在的特征选择偏差。基于此方案，我们开发了MTLComb——包含优化流程、训练协议及超参数估计程序的MTL算法与软件包。MTLComb专为混合类型任务间的共享预测器学习而设计。为验证MTLComb的有效性，我们在模拟数据及涉及脓毒症与精神分裂症的生物医学研究中进行了测试。

相关内容

多任务学习

关注 162

多任务学习（MTL）是机器学习的一个子领域，可以同时解决多个学习任务，同时利用各个任务之间的共性和差异。与单独训练模型相比，这可以提高特定任务模型的学习效率和预测准确性。多任务学习是归纳传递的一种方法，它通过将相关任务的训练信号中包含的域信息用作归纳偏差来提高泛化能力。通过使用共享表示形式并行学习任务来实现,每个任务所学的知识可以帮助更好地学习其它任务。

《生成式模型: 变分自编码器与扩散模型》，75页ppt，Google DeepMind科学家Ruiqi Gao

专知会员服务

66+阅读 · 2023年6月10日

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

【ACL2020】多模态信息抽取，365页ppt

专知会员服务

151+阅读 · 2020年7月6日