Recommender systems often grapple with noisy implicit feedback. Most studies alleviate the noise issues from data cleaning perspective such as data resampling and reweighting, but they are constrained by heuristic assumptions. Another denoising avenue is from model perspective, which proactively injects noises into user-item interactions and enhances the intrinsic denoising ability of models. However, this kind of denoising process poses significant challenges to the recommender model's representation capacity to capture noise patterns. To address this issue, we propose Denoising Diffusion Recommender Model (DDRM), which leverages multi-step denoising process of diffusion models to robustify user and item embeddings from any recommender models. DDRM injects controlled Gaussian noises in the forward process and iteratively removes noises in the reverse denoising process, thereby improving embedding robustness against noisy feedback. To achieve this target, the key lies in offering appropriate guidance to steer the reverse denoising process and providing a proper starting point to start the forward-reverse process during inference. In particular, we propose a dedicated denoising module that encodes collaborative information as denoising guidance. Besides, in the inference stage, DDRM utilizes the average embeddings of users' historically liked items as the starting point rather than using pure noise since pure noise lacks personalization, which increases the difficulty of the denoising process. Extensive experiments on three datasets with three representative backend recommender models demonstrate the effectiveness of DDRM.
翻译:推荐系统常受隐式反馈中的噪声困扰。多数研究从数据清洗角度缓解噪声问题,如数据重采样与重加权,但这些方法受限于启发式假设。另一条去噪路径从模型视角出发,主动向用户-物品交互注入噪声以增强模型内在去噪能力。然而,此类去噪过程对推荐模型捕捉噪声模式的表示能力提出了重大挑战。为解决该问题,我们提出去噪扩散推荐模型(DDRM),利用扩散模型的多步去噪过程增强任意推荐模型的用户与物品嵌入鲁棒性。DDRM在前向过程中注入受控高斯噪声,并在反向去噪过程中迭代消除噪声,从而提升嵌入对噪声反馈的稳健性。实现该目标的关键在于:为反向去噪过程提供恰当引导,并在推理阶段为前向-反向过程设置合适的起始点。具体而言,我们设计了专用去噪模块,将协同信息编码为去噪引导信号。此外,在推理阶段,DDRM采用用户历史偏好物品的平均嵌入作为起始点而非纯噪声——因纯噪声缺乏个性化特征,会增加去噪过程难度。基于三个数据集与三种代表性后端推荐模型的大量实验验证了DDRM的有效性。