Conversational dense retrieval has shown to be effective in conversational search. However, a major limitation of conversational dense retrieval is their lack of interpretability, hindering intuitive understanding of model behaviors for targeted improvements. This paper presents CONVINV, a simple yet effective approach to shed light on interpretable conversational dense retrieval models. CONVINV transforms opaque conversational session embeddings into explicitly interpretable text while faithfully maintaining their original retrieval performance as much as possible. Such transformation is achieved by training a recently proposed Vec2Text model based on the ad-hoc query encoder, leveraging the fact that the session and query embeddings share the same space in existing conversational dense retrieval. To further enhance interpretability, we propose to incorporate external interpretable query rewrites into the transformation process. Extensive evaluations on three conversational search benchmarks demonstrate that CONVINV can yield more interpretable text and faithfully preserve original retrieval performance than baselines. Our work connects opaque session embeddings with transparent query rewriting, paving the way toward trustworthy conversational search.
翻译:会话稠密检索已被证明在会话搜索中具有显著效果。然而,该方法的主要局限在于其缺乏可解释性,这阻碍了对模型行为的直观理解,从而难以进行针对性改进。本文提出CONVINV——一种简洁而有效的方案,旨在为可解释的会话稠密检索模型提供新的视角。CONVINV能够将不透明的会话嵌入转化为显式可解释的文本,同时尽可能忠实保持其原有的检索性能。该转化通过基于即席查询编码器训练最新提出的Vec2Text模型实现,其依据在于现有会话稠密检索中会话嵌入与查询嵌入共享同一表征空间。为进一步增强可解释性,我们提出在转化过程中引入外部可解释的查询重写机制。在三个会话搜索基准上的广泛实验表明,相较于基线方法,CONVINV能够生成更具可解释性的文本,并更忠实地保持原始检索性能。本工作通过连接不透明的会话嵌入与透明的查询重写,为构建可信赖的会话搜索系统开辟了新路径。