Multi-Task Recommendations with Reinforcement Learning

In recent years, Multi-task Learning (MTL) has yielded immense success in Recommender System (RS) applications. However, current MTL-based recommendation models tend to disregard the session-wise patterns of user-item interactions because they are predominantly constructed based on item-wise datasets. Moreover, balancing multiple objectives has always been a challenge in this field, which is typically avoided via linear estimations in existing works. To address these issues, in this paper, we propose a Reinforcement Learning (RL) enhanced MTL framework, namely RMTL, to combine the losses of different recommendation tasks using dynamic weights. To be specific, the RMTL structure can address the two aforementioned issues by (i) constructing an MTL environment from session-wise interactions and (ii) training multi-task actor-critic network structure, which is compatible with most existing MTL-based recommendation models, and (iii) optimizing and fine-tuning the MTL loss function using the weights generated by critic networks. Experiments on two real-world public datasets demonstrate the effectiveness of RMTL with a higher AUC against state-of-the-art MTL-based recommendation models. Additionally, we evaluate and validate RMTL's compatibility and transferability across various MTL models.

翻译：近年来，多任务学习在推荐系统应用中取得了显著成功。然而，当前基于MTL的推荐模型往往忽视了用户-物品交互的会话级模式，因为这些模型主要基于逐物品数据集构建。此外，平衡多个目标一直是该领域面临的挑战，现有工作中通常通过线性估计来规避这一问题。为解决上述问题，本文提出了一种基于强化学习的增强型MTL框架——RMTL，通过动态权重融合不同推荐任务的损失函数。具体而言，RMTL结构通过以下方式解决上述两个问题：（i）从会话级交互构建MTL环境；（ii）训练与现有大多数基于MTL的推荐模型兼容的多任务演员-评论家网络结构；（iii）利用评论家网络生成的权重优化和微调MTL损失函数。在两个真实公开数据集上的实验表明，RMTL相比最先进的基于MTL的推荐模型取得了更高的AUC值，验证了其有效性。此外，我们还评估并验证了RMTL在多种MTL模型中的兼容性与可迁移性。

相关内容

多任务学习

关注 162

多任务学习（MTL）是机器学习的一个子领域，可以同时解决多个学习任务，同时利用各个任务之间的共性和差异。与单独训练模型相比，这可以提高特定任务模型的学习效率和预测准确性。多任务学习是归纳传递的一种方法，它通过将相关任务的训练信号中包含的域信息用作归纳偏差来提高泛化能力。通过使用共享表示形式并行学习任务来实现,每个任务所学的知识可以帮助更好地学习其它任务。

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

如何使用TensorFlow 排序构建推荐系统? How to build a recommendation system using TensorFlow Ranking?

专知会员服务

19+阅读 · 2022年3月13日

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

专知会员服务

105+阅读 · 2022年2月10日