An Image Dataset for Benchmarking Recommender Systems with Raw Pixels

Recommender systems (RS) have achieved significant success by leveraging explicit identification (ID) features. However, the full potential of content features, especially the pure image pixel features, remains relatively unexplored. The limited availability of large, diverse, and content-driven image recommendation datasets has hindered the use of raw images as item representations. In this regard, we present PixelRec, a massive image-centric recommendation dataset that includes approximately 200 million user-image interactions, 30 million users, and 400,000 high-quality cover images. By providing direct access to raw image pixels, PixelRec enables recommendation models to learn item representation directly from them. To demonstrate its utility, we begin by presenting the results of several classical pure ID-based baseline models, termed IDNet, trained on PixelRec. Then, to show the effectiveness of the dataset's image features, we substitute the itemID embeddings (from IDNet) with a powerful vision encoder that represents items using their raw image pixels. This new model is dubbed PixelNet.Our findings indicate that even in standard, non-cold start recommendation settings where IDNet is recognized as highly effective, PixelNet can already perform equally well or even better than IDNet. Moreover, PixelNet has several other notable advantages over IDNet, such as being more effective in cold-start and cross-domain recommendation scenarios. These results underscore the importance of visual features in PixelRec. We believe that PixelRec can serve as a critical resource and testing ground for research on recommendation models that emphasize image pixel content. The dataset, code, and leaderboard will be available at https://github.com/website-pixelrec/PixelRec.

翻译：推荐系统（RS）通过利用显式身份（ID）特征取得了显著成功。然而，内容特征，尤其是纯图像像素特征的潜力仍相对未被充分发掘。由于缺乏大规模、多样化且以内容驱动的图像推荐数据集，原始图像作为物品表征的应用受到制约。针对这一问题，我们提出PixelRec——一个以图像为核心的大规模推荐数据集，包含约2亿条用户-图像交互记录、3000万用户及40万张高质量封面图像。通过直接提供原始图像像素，PixelRec使推荐模型能够直接学习物品表征。为展示其效用，我们首先报告了在PixelRec上训练的若干经典纯ID基线模型（称为IDNet）的实验结果。随后，通过使用强大的视觉编码器（以原始图像像素表征物品）替换IDNet中的物品ID嵌入，我们构建了新模型PixelNet。实验表明，即使在IDNet被认为高度有效的标准非冷启动推荐场景中，PixelNet已能达到与IDNet相当甚至更优的性能。此外，PixelNet在冷启动和跨域推荐场景中展现出优于IDNet的显著优势。这些结果凸显了PixelRec中视觉特征的重要性。我们相信PixelRec将成为基于图像像素内容的推荐模型研究的关键资源与测试基准。数据集、代码及排行榜将发布于https://github.com/website-pixelrec/PixelRec。

相关内容

数据集

关注 88

数据集，又称为资料集、数据集合或资料集合，是一种由数据所组成的集合。
Data set（或dataset）是一个数据的集合，通常以表格形式出现。每一列代表一个特定变量。每一行都对应于某一成员的数据集的问题。它列出的价值观为每一个变量，如身高和体重的一个物体或价值的随机数。每个数值被称为数据资料。对应于行数，该数据集的数据可能包括一个或多个成员。

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

35+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日