Sample Enrichment via Temporary Operations on Subsequences for Sequential Recommendation

Sequential recommendation leverages interaction sequences to predict forthcoming user behaviors, crucial for crafting personalized recommendations. However, the true preferences of a user are inherently complex and high-dimensional, while the observed data is merely a simplified and low-dimensional projection of the rich preferences, which often leads to prevalent issues like data sparsity and inaccurate model training. To learn true preferences from the sparse data, most existing works endeavor to introduce some extra information or design some ingenious models. Although they have shown to be effective, extra information usually increases the cost of data collection, and complex models may result in difficulty in deployment. Innovatively, we avoid the use of extra information or alterations to the model; instead, we fill the transformation space between the observed data and the underlying preferences with randomness. Specifically, we propose a novel model-agnostic and highly generic framework for sequential recommendation called sample enrichment via temporary operations on subsequences (SETO), which temporarily and separately enriches the transformation space via sequence enhancement operations with rationality constraints in training. The transformation space not only exists in the process from input samples to preferences but also in preferences to target samples. We highlight our SETO's effectiveness and versatility over multiple representative and state-of-the-art sequential recommendation models (including six single-domain sequential models and two cross-domain sequential models) across multiple real-world datasets (including three single-domain datasets, three cross-domain datasets and a large-scale industry dataset).

翻译：序列推荐利用交互序列预测用户未来行为，这对构建个性化推荐至关重要。然而，用户的真实偏好本质上是复杂且高维的，而观测数据仅是丰富偏好的简化低维投影，这常导致数据稀疏性和模型训练不准确等普遍问题。为从稀疏数据中学习真实偏好，现有研究大多致力于引入额外信息或设计精巧模型。尽管这些方法已被证明有效，但额外信息通常增加数据收集成本，复杂模型则可能导致部署困难。我们创新性地避免了使用额外信息或修改模型，而是通过随机性填充观测数据与潜在偏好之间的转换空间。具体而言，我们提出了一种新颖的模型无关且高度通用的序列推荐框架——基于子序列临时操作的样本增强（SETO），该框架在训练期间通过具有合理性约束的序列增强操作，临时且独立地丰富转换空间。转换空间不仅存在于从输入样本到偏好的过程中，也存在于偏好到目标样本的过程中。我们在多个真实数据集（包括三个单域数据集、三个跨域数据集和一个大规模工业数据集）上，通过多种代表性及最先进的序列推荐模型（包括六个单域序列模型和两个跨域序列模型），验证了SETO的有效性和通用性。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/