Clustered Embedding Learning for Recommender Systems

In recent years, recommender systems have advanced rapidly, where embedding learning for users and items plays a critical role. A standard method learns a unique embedding vector for each user and item. However, such a method has two important limitations in real-world applications: 1) it is hard to learn embeddings that generalize well for users and items with rare interactions on their own; and 2) it may incur unbearably high memory costs when the number of users and items scales up. Existing approaches either can only address one of the limitations or have flawed overall performances. In this paper, we propose Clustered Embedding Learning (CEL) as an integrated solution to these two problems. CEL is a plug-and-play embedding learning framework that can be combined with any differentiable feature interaction model. It is capable of achieving improved performance, especially for cold users and items, with reduced memory cost. CEL enables automatic and dynamic clustering of users and items in a top-down fashion, where clustered entities jointly learn a shared embedding. The accelerated version of CEL has an optimal time complexity, which supports efficient online updates. Theoretically, we prove the identifiability and the existence of a unique optimal number of clusters for CEL in the context of nonnegative matrix factorization. Empirically, we validate the effectiveness of CEL on three public datasets and one business dataset, showing its consistently superior performance against current state-of-the-art methods. In particular, when incorporating CEL into the business model, it brings an improvement of $+0.6\%$ in AUC, which translates into a significant revenue gain; meanwhile, the size of the embedding table gets $2650$ times smaller.

翻译：近年来，推荐系统发展迅速，其中用户与物品的嵌入学习扮演着关键角色。标准方法为每个用户和物品学习一个唯一的嵌入向量。然而，这种方法在实际应用中存在两个重要局限：1）对于交互稀疏的用户和物品，难以学习到泛化性能良好的嵌入；2）当用户和物品数量规模扩大时，可能产生难以承受的高内存成本。现有方法要么只能解决其中一个局限，要么整体性能存在缺陷。本文提出聚类嵌入学习（CEL）作为针对这两个问题的集成解决方案。CEL是一种即插即用的嵌入学习框架，可与任意可微特征交互模型结合使用。它能够实现改进的性能，尤其针对冷用户和冷物品，同时降低内存成本。CEL采用自上而下的方式实现用户和物品的自动动态聚类，聚类后的实体共同学习共享嵌入。加速版CEL具有最优时间复杂度，支持高效的在线更新。理论上，我们在非负矩阵分解背景下证明了CEL的可辨识性及唯一最优聚类数的存在性。实证方面，我们在三个公开数据集和一个商业数据集上验证了CEL的有效性，展示了其相对于当前最先进方法持续优越的性能。特别地，将CEL融入商业模型后，AUC指标提升+0.6%，转化为显著的收益增长；同时，嵌入表规模缩小了2650倍。