Mitigating Preference Leakage via Strict Estimator Separation for Normative Generative Ranking

In Generative Information Retrieval (GenIR), the bottleneck has shifted from generation to the selection of candidates, particularly for normative criteria such as cultural relevance. Current LLM-as-a-Judge evaluations often suffer from circularity and preference leakage, where overlapping supervision and evaluation models inflate performance. We address this by formalising cultural relevance as a within-query ranking task and introducing a leakage-free two-judge framework that strictly separates supervision (Judge B) from evaluation (Judge A). On a new benchmark of 33,052 (NGR-33k) culturally grounded stories, we find that while classical baselines yield only modest gains, a dense bi-encoder distilled from a Judge-B-supervised Cross-Encoder is highly effective. Although the Cross-Encoder provides a strong supervision signal for distillation, the distilled BGE-M3 model substantially outperforms it under leakage-free Judge~A evaluation. We validate our framework on the human-curated Moral Stories dataset, showing strong alignment with human norms. Our results demonstrate that rigorous evaluator separation is a prerequisite for credible GenIR evaluation, proving that subtle cultural preferences can be distilled into efficient rankers without leakage.

翻译：在生成式信息检索（GenIR）中，研究瓶颈已从生成过程转向候选结果的选择，特别是在文化相关性等规范性标准方面。当前基于大语言模型的评估方法常存在循环论证与偏好泄露问题，即监督模型与评估模型的重叠使用导致性能评估虚高。本研究通过将文化相关性形式化为查询内排序任务，并提出一种无泄露的双评估器框架来应对该问题，该框架严格分离监督功能（评估器B）与评估功能（评估器A）。基于新构建的包含33,052个文化背景故事的数据集（NGR-33k），研究发现：虽然经典基线方法仅能带来有限提升，但通过从评估器B监督的交叉编码器蒸馏得到的稠密双编码器表现出显著优势。尽管交叉编码器为蒸馏过程提供了强监督信号，但经蒸馏的BGE-M3模型在无泄露的评估器A测试中大幅超越其性能。我们在人工标注的Moral Stories数据集上验证了该框架，结果显示其与人类规范高度契合。本研究证明，严格的评估器分离是可信GenIR评估的前提条件，并证实了微妙的文化偏好能够在不泄露的情况下蒸馏至高效的排序模型中。

相关内容

排序

关注 313

排序是计算机内经常进行的一种操作，其目的是将一组“无序”的记录序列调整为“有序”的记录序列。分内部排序和外部排序。若整个排序过程不需要访问外存便能完成，则称此类排序问题为内部排序。反之，若参加排序的记录数量很大，整个序列的排序过程不可能在内存中完成，则称此类排序问题为外部排序。内部排序的过程是一个逐步扩大记录的有序序列长度的过程。

【AAAI2026】TruthfulRAG：基于知识图谱解决检索增强生成中的事实层冲突

专知会员服务

24+阅读 · 2025年11月15日

多样化偏好优化

专知会员服务

12+阅读 · 2025年2月3日

生成式信息检索综述

专知会员服务

35+阅读 · 2024年6月5日

人大最新《从匹配到生成：生成式信息检索》综述

专知会员服务

30+阅读 · 2024年4月25日