Task Relation-aware Continual User Representation Learning

User modeling, which learns to represent users into a low-dimensional representation space based on their past behaviors, got a surge of interest from the industry for providing personalized services to users. Previous efforts in user modeling mainly focus on learning a task-specific user representation that is designed for a single task. However, since learning task-specific user representations for every task is infeasible, recent studies introduce the concept of universal user representation, which is a more generalized representation of a user that is relevant to a variety of tasks. Despite their effectiveness, existing approaches for learning universal user representations are impractical in real-world applications due to the data requirement, catastrophic forgetting and the limited learning capability for continually added tasks. In this paper, we propose a novel continual user representation learning method, called TERACON, whose learning capability is not limited as the number of learned tasks increases while capturing the relationship between the tasks. The main idea is to introduce an embedding for each task, i.e., task embedding, which is utilized to generate task-specific soft masks that not only allow the entire model parameters to be updated until the end of training sequence, but also facilitate the relationship between the tasks to be captured. Moreover, we introduce a novel knowledge retention module with pseudo-labeling strategy that successfully alleviates the long-standing problem of continual learning, i.e., catastrophic forgetting. Extensive experiments on public and proprietary real-world datasets demonstrate the superiority and practicality of TERACON. Our code is available at https://github.com/Sein-Kim/TERACON.

翻译：用户建模通过学习用户历史行为将其映射到低维表征空间，为提供个性化服务在工业界引起了广泛关注。现有用户建模方法主要聚焦于为单一任务学习任务特定的用户表征。然而，由于为每个任务学习任务特定用户表征并不可行，近期研究引入了通用用户表征的概念——更通用的用户表征可适用于多种任务。尽管现有通用用户表征学习方法具有有效性，但在实际应用中由于数据需求、灾难性遗忘以及对持续新增任务的学习能力有限等问题而难以落地。本文提出一种名为TERACON的新型持续用户表征学习方法，该方法在学习能力随任务数量增加不受限的同时，能够捕捉任务间关系。核心思想是为每个任务引入任务嵌入（task embedding），用于生成任务特定的软掩码，这不仅使整个模型参数在训练序列结束时仍能持续更新，还促进了任务间关系的捕获。此外，我们引入基于伪标签策略的新型知识保留模块，成功缓解了持续学习中长期存在的灾难性遗忘问题。在公开及专有真实世界数据集上的大量实验证明了TERACON的优越性和实用性。我们的代码已开源：https://github.com/Sein-Kim/TERACON。

相关内容

Continuity

关注 4

让 iOS 8 和 OS X Yosemite 无缝切换的一个新特性。 > Apple products have always been designed to work together beautifully. But now they may really surprise you. With iOS 8 and OS X Yosemite, you’ll be able to do more wonderful things than ever before.

Source: Apple - iOS 8

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

【亚马逊-WWW2020】不解析,生成!用于面向任务的语义分析的序列到序列体系结构，Don't Parse, Generate! A Sequence to Sequence Architecture for Task-Oriented Semantic Parsing

专知会员服务

15+阅读 · 2020年2月1日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日