In real-world scenarios, most platforms collect both large-scale, naturally noisy implicit feedback and small-scale yet highly relevant explicit feedback. Due to the issue of data sparsity, implicit feedback is often the default choice for training recommender systems (RS), however, such data could be very noisy due to the randomness and diversity of user behaviors. For instance, a large portion of clicks may not reflect true user preferences and many purchases may result in negative reviews or returns. Fortunately, by utilizing the strengths of both types of feedback to compensate for the weaknesses of the other, we can mitigate the above issue at almost no cost. In this work, we propose an Automated Data Denoising framework, \textbf{\textit{AutoDenoise}}, for recommendation, which uses a small number of explicit data as validation set to guide the recommender training. Inspired by the generalized definition of curriculum learning (CL), AutoDenoise learns to automatically and dynamically assign the most appropriate (discrete or continuous) weights to each implicit data sample along the training process under the guidance of the validation performance. Specifically, we use a delicately designed controller network to generate the weights, combine the weights with the loss of each input data to train the recommender system, and optimize the controller with reinforcement learning to maximize the expected accuracy of the trained RS on the noise-free validation set. Thorough experiments indicate that AutoDenoise is able to boost the performance of the state-of-the-art recommendation algorithms on several public benchmark datasets.
翻译:在实际场景中,大多数平台既收集大规模但存在自然噪声的隐式反馈数据,也收集规模较小但相关性极高的显式反馈数据。由于数据稀疏性问题,隐式反馈常被作为训练推荐系统(RS)的默认选择,但这类数据可能因用户行为的随机性和多样性而包含大量噪声。例如,大部分点击行为可能并不反映用户的真实偏好,许多购买行为最终可能导致差评或退货。幸运的是,通过利用两类反馈的优势互补,我们可以在几乎零成本的情况下缓解上述问题。本文提出面向推荐的自动化数据去噪框架\textbf{\textit{AutoDenoise}},该框架利用少量显式数据作为验证集来指导推荐模型训练。受课程学习(CL)广义定义的启发,AutoDenoise能够在训练过程中根据验证性能的引导,自动动态地为每个隐式数据样本分配最合适的(离散或连续)权重。具体而言,我们设计精密的控制器网络生成权重,将权重与各输入数据的损失函数相结合以训练推荐系统,并通过强化学习优化控制器,使训练后的推荐系统在无噪声验证集上的预期准确率最大化。大量实验表明,AutoDenoise能够显著提升多个公开基准数据集上最先进推荐算法的性能。