As the use of online platforms continues to grow across all demographics, users often express a desire to feel represented in the content. To improve representation in search results and recommendations, we introduce end-to-end diversification, ensuring that diverse content flows throughout the various stages of these systems, from retrieval to ranking. We develop, experiment, and deploy scalable diversification mechanisms in multiple production surfaces on the Pinterest platform, including Search, Related Products, and New User Homefeed, to improve the representation of different skin tones in beauty and fashion content. Diversification in production systems includes three components: identifying requests that will trigger diversification, ensuring diverse content is retrieved from the large content corpus during the retrieval stage, and finally, balancing the diversity-utility trade-off in a self-adjusting manner in the ranking stage. Our approaches, which evolved from using Strong-OR logical operator to bucketized retrieval at the retrieval stage and from greedy re-rankers to multi-objective optimization using determinantal point processes for the ranking stage, balances diversity and utility while enabling fast iterations and scalable expansion to diversification over multiple dimensions. Our experiments indicate that these approaches significantly improve diversity metrics, with a neutral to a positive impact on utility metrics and improved user satisfaction, both qualitatively and quantitatively, in production. An accessible PDF of this article is available at https://drive.google.com/file/d/1p5PkqC-sdtX19Y_IAjZCtiSxSEX1IP3q/view
翻译:随着在线平台在各个人群中的使用持续增长,用户通常希望自己在内容中能够得到体现。为了改善搜索结果和推荐内容中的呈现,我们引入了端到端多样化方法,确保从检索到排序的各个系统阶段都能包含多样化内容。我们在Pinterest平台的多个生产场景中(包括搜索、相关产品和新生首页信息流)开发、实验并部署了可扩展的多样化机制,以改善美妆和时尚内容中不同肤色的呈现。生产系统中的多样化包括三个组成部分:识别需要触发多样化的请求,确保在检索阶段从海量内容库中获取多样化内容,最后在排序阶段以自适应方式权衡多样性与效用性。我们的方法在检索阶段从使用强或逻辑运算符演变为分桶检索,在排序阶段从贪心重排序器演变为基于行列式点过程的多目标优化,在平衡多样性与效用性的同时,支持快速迭代和多维度多样化的可扩展扩展。实验表明,这些方法显著提升了多样性指标,对效用性指标产生中性至正向影响,并在生产环境中从定性和定量两方面改善了用户满意度。本文可访问的PDF版本见:https://drive.google.com/file/d/1p5PkqC-sdtX19Y_IAjZCtiSxSEX1IP3q/view