Neural Information Retrieval (NIR) has significantly improved upon heuristic-based IR systems. Yet, failures remain frequent, the models used often being unable to retrieve documents relevant to the user's query. We address this challenge by proposing a lightweight abstention mechanism tailored for real-world constraints, with particular emphasis placed on the reranking phase. We introduce a protocol for evaluating abstention strategies in a black-box scenario, demonstrating their efficacy, and propose a simple yet effective data-driven mechanism. We provide open-source code for experiment replication and abstention implementation, fostering wider adoption and application in diverse contexts.
翻译:神经信息检索已显著改善了基于启发式规则的信息检索系统。然而,检索失败的情况仍频繁发生,所使用的模型往往无法检索到与用户查询相关的文档。针对这一挑战,我们提出一种轻量级的弃权机制,该机制专为实际约束条件设计,尤其关注重排序阶段。我们引入了一套在黑盒场景下评估弃权策略的协议,并证明了其有效性,同时提出了一种简单而有效的数据驱动机制。此外,我们提供了用于实验复现和弃权机制实现的开源代码,以促进该方案在不同场景中的广泛应用与推广。