LRanker: LLM Ranker for Massive Candidates

Large language models (LLMs) have recently shown strong potential for ranking by capturing semantic relevance and adapting across diverse domains, yet existing methods remain constrained by limited context length and high computational costs, restricting their applicability to real-world scenarios where candidate pools often scale to millions. To address this challenge, we propose LRanker, a framework tailored for large-candidate ranking. LRanker incorporates a candidate aggregation encoder that leverages K-means clustering to explicitly model global candidate information, and a graph-based test-time scaling mechanism that partitions candidates into subsets, generates multiple query embeddings, and integrates them through an ensemble procedure. By aggregating diverse embeddings instead of relying on a single representation, this mechanism enhances robustness and expressiveness, leading to more accurate ranking over massive candidate pools. We evaluate LRanker on seven tasks across three scenarios in RBench with different candidate scales. Experimental results show that LRanker achieves over 30% gains in the RBench-Small scenario, improves by 3-9% in MRR in the RBench-Large scenario, and sustains scalability with 20-30% improvements in the RBench-Ultra scenario with more than 6.8M candidates. Ablation studies further verify the effectiveness of its key components. Together, these findings demonstrate the robustness, scalability, and effectiveness of LRanker for massive-candidate ranking.

翻译：大语言模型（LLM）近期通过捕捉语义相关性并在不同领域间自适应展现出了强大的排序潜力，但现有方法仍受限于有限的上下文长度和高昂的计算成本，难以应用于候选集规模可达百万级的真实场景。为应对这一挑战，我们提出LRanker——一个专为大规模候选集排序设计的框架。该框架包含候选聚合编码器（利用K-means聚类显式建模全局候选信息）和基于图的测试时扩展机制（将候选集划分为子集、生成多元查询嵌入并通过集成策略融合）。该机制通过聚合多样化嵌入而非依赖单一表征，增强了鲁棒性与表达能力，从而在海量候选池中实现更精准的排序。我们在RBench的三个场景（涵盖不同候选规模）的七项任务上评估了LRanker。实验结果表明：在RBench-Small场景中LRanker取得超30%的性能提升；在RBench-Large场景中MRR指标提升3-9%；在包含逾680万候选的RBench-Ultra场景中仍保持可扩展性，性能提升20-30%。消融实验进一步验证了其关键模块的有效性。这些发现共同证明了LRanker在海量候选集排序任务中的鲁棒性、可扩展性与有效性。

相关内容

排序

关注 313

排序是计算机内经常进行的一种操作，其目的是将一组“无序”的记录序列调整为“有序”的记录序列。分内部排序和外部排序。若整个排序过程不需要访问外存便能完成，则称此类排序问题为内部排序。反之，若参加排序的记录数量很大，整个序列的排序过程不可能在内存中完成，则称此类排序问题为外部排序。内部排序的过程是一个逐步扩大记录的有序序列长度的过程。

KDD 2026 | MixRAGRec：面向LLM推荐的混合专家KG-RAG框架

专知会员服务

12+阅读 · 5月31日

LaCache：用于高效长上下文建模的大语言模型梯状KV缓存机制

专知会员服务

11+阅读 · 2025年7月23日

142页DeepSeek-R1 思维链技术：让我们一起<思考>大语言模型（LLM）的推理能力

专知会员服务

48+阅读 · 2025年4月12日

【新书】解码大型语言模型：理解、实现与优化LLM在自然语言处理应用中的全面指南

专知会员服务

49+阅读 · 2024年12月13日