Generative retrieval (GR) maps queries directly to document identifiers (docids) using parametric knowledge, However, this design makes corpus expansion costly: adding new documents requires updating model parameters to encode new document-docid associations incurs repeated training and catastrophic forgetting of previously indexed documents. In this work, we revisit incremental GR as an in-context retrieval problem, where newly added documents are supplied as inference-time document-docid evidence. We propose ICICLE, an in-context indexing framework that performs source-aware docid generation over both parametric memory and context-provided document-docid pairs. ICICLE combines a `[COPY]`-based routing mechanism, preference-based calibration, and large context adaptation to distinguish context-grounded retrieval from parametric retrieval. Experiments on MS MARCO and NQ320K show that ICICLE improves retrieval of newly introduced documents while preserving seen-document retention without corpus-specific retraining. Our analysis further shows that high-shot degradation is mainly caused by routing failure, highlighting source-selection calibration as a key bottleneck for scaling in-context generative retrieval.
翻译:摘要:生成式检索(GR)利用参数化知识将查询直接映射至文档标识符(docid)。然而,这一设计使得语料库扩展代价高昂:添加新文档需更新模型参数以编码新的文档-docid关联,从而导致重复训练以及先前索引文档的灾难性遗忘。本研究将增量式GR重新定义为上下文检索问题,其中新增文档作为推理时的文档-docid证据提供。我们提出ICICLE框架,这是一种上下文索引框架,能够在参数化记忆与上下文提供的文档-docid对上进行源感知的docid生成。ICICLE结合了基于`[COPY]`的路由机制、偏好校准以及大上下文自适应,以区分基于上下文的检索与参数化检索。在MS MARCO和NQ320K上的实验表明,ICICLE提升了新引入文档的检索效果,同时无需针对语料库的重新训练即可保持对已见文档的保留能力。进一步分析显示,高样本退化主要由路由失效导致,这凸显了源选择校准作为扩展上下文生成式检索规模的关键瓶颈。