Learning to Rank (LTR) methods are vital in online economies, affecting users and item providers. Fairness in LTR models is crucial to allocate exposure proportionally to item relevance. The deterministic ranking model can lead to unfair exposure distribution when items with the same relevance receive slightly different scores. Stochastic LTR models, incorporating the Plackett-Luce (PL) model, address fairness issues but have limitations in computational cost and performance guarantees. To overcome these limitations, we propose FairLTR-RC, a novel post-hoc model-agnostic method. FairLTR-RC leverages a pretrained scoring function to create a stochastic LTR model, eliminating the need for expensive training. Furthermore, FairLTR-RC provides finite-sample guarantees on a user-specified utility using distribution-free risk control framework. By additionally incorporating the Thresholded PL (TPL) model, we are able to achieve an effective trade-off between utility and fairness. Experimental results on several benchmark datasets demonstrate that FairLTR-RC significantly improves fairness in widely-used deterministic LTR models while guaranteeing a specified level of utility.
翻译:排序学习(LTR)方法在在线经济中至关重要,影响着用户和物品提供方。LTR模型中的公平性对于根据物品相关性按比例分配曝光度至关重要。当具有相同相关性的物品获得略微不同的分数时,确定性排序模型可能导致不公平的曝光分布。融合Plackett-Luce(PL)模型的随机LTR模型能解决公平性问题,但在计算成本和性能保证方面存在局限性。为克服这些局限性,我们提出FairLTR-RC——一种新颖的事后模型无关方法。FairLTR-RC利用预训练评分函数构建随机LTR模型,无需昂贵的重新训练。此外,FairLTR-RC通过无分布风险控制框架,在用户指定效用上提供有限样本保证。通过额外引入阈值化PL(TPL)模型,我们能够实现效用与公平性之间的有效权衡。在多个基准数据集上的实验结果表明,FairLTR-RC在保证指定效用水平的同时,显著提升了广泛使用的确定性LTR模型的公平性。