Query-focused and Memory-aware Reranker for Long Context Processing

Built upon the existing analysis of retrieval heads in large language models, we propose an alternative reranking framework that trains models to estimate passage-query relevance using the attention scores of selected heads. This approach provides a listwise solution that leverages holistic information within the entire candidate shortlist during ranking. At the same time, it naturally produces continuous relevance scores, enabling training on arbitrary retrieval datasets without requiring Likert-scale supervision. Our framework is lightweight and effective, requiring only small-scale models (e.g., 4B parameters) to achieve strong performance. Extensive experiments demonstrate that our method outperforms existing state-of-the-art pointwise and listwise rerankers across multiple domains, including Wikipedia and long narrative datasets. It further establishes a new state-of-the-art on the LoCoMo benchmark that assesses the capabilities of dialogue understanding and memory usage. We further demonstrate that our framework supports flexible extensions. For example, augmenting candidate passages with contextual information further improves ranking accuracy, while training attention heads from middle layers enhances efficiency without sacrificing performance.

翻译：基于对大型语言模型中检索头部的现有分析，我们提出了一种替代性重排序框架，该框架通过训练模型利用选定头部的注意力分数来估计段落-查询相关性。该方法提供了一种列表式解决方案，在排序过程中利用整个候选短名单内的整体信息。同时，它自然地生成连续的相关性分数，使得能够在任意检索数据集上进行训练，而无需利克特量表监督。我们的框架轻量且高效，仅需小规模模型（例如40亿参数）即可实现强劲性能。大量实验表明，我们的方法在包括维基百科和长篇叙事数据集在内的多个领域，均优于现有的最先进的点式和列表式重排序器。它进一步在评估对话理解与记忆使用能力的LoCoMo基准上确立了新的最先进水平。我们还证明，我们的框架支持灵活的扩展。例如，通过上下文信息增强候选段落可进一步提升排序准确性，而训练来自中间层的注意力头部则能在不牺牲性能的情况下提高效率。

相关内容

排序

关注 313

排序是计算机内经常进行的一种操作，其目的是将一组“无序”的记录序列调整为“有序”的记录序列。分内部排序和外部排序。若整个排序过程不需要访问外存便能完成，则称此类排序问题为内部排序。反之，若参加排序的记录数量很大，整个序列的排序过程不可能在内存中完成，则称此类排序问题为外部排序。内部排序的过程是一个逐步扩大记录的有序序列长度的过程。

什么是上下文工程？中科院计算所等《大语言模型的上下文工程》综述

专知会员服务

43+阅读 · 2025年7月18日

【ICML2025】使用树搜索重新排序推理上下文，使大型视觉语言模型更强大

专知会员服务

7+阅读 · 2025年6月10日

ChatGP能生成，但搜索行么? 山大百度最新《将大型语言模型作为重排序代理进行研究》

专知会员服务

35+阅读 · 2023年4月20日

【RecSys22教程】多阶段推荐系统的神经重排序，90页ppt

专知会员服务

27+阅读 · 2022年9月30日