Large Language Models (LLMs) are increasingly adopted as backbones for Generative Recommendation (GR), promising access to pretrained world knowledge. Yet reliably invoking this knowledge for GR remains poorly understood. A key obstacle is that LLM-based GR typically represents items with Semantic IDs (SIDs), disrupting LLMs' natural-language reasoning interface because these tokens are unseen by the LLM during pretraining. Existing approaches address this with expensive multi-stage pipelines that ground SIDs and elicit explicit rationales, but offer limited insight into when and why each stage is necessary. In this work, we systematically decompose explicit reasoning training pipelines for LLM-based GR, revealing three key limitations: weakened world-knowledge verbalization, misalignment between SID and natural-language token embedding spaces, and sensitivity to rationale quality, all of which hurt explicit reasoning performance. To circumvent these issues, we propose PauseRec, a lightweight implicit reasoning paradigm tailored for GR. PauseRec is exceptionally practical, avoiding costly reasoning trace acquisition and reasoning alignment training, leading to a multitude of benefits: (1) it outperforms standard explicit CoT methods by up to 6.22%, (2) it reduces training cost by up to 65% GPU hours, and (3) it speeds up inference by up to 71.3%. These results position PauseRec as a lightweight alternative to explicit rationale generation, enabling more effective and efficient LLM-based GR.
翻译:大型语言模型(LLMs)正越来越多地被用作生成式推荐(GR)的骨干网络,有望利用预训练的世界知识。然而,如何可靠地调用这些知识用于GR仍不甚明了。一个关键障碍在于,基于LLM的GR通常使用语义ID(SID)来表示物品,这破坏了LLM的自然语言推理接口,因为这些标记在LLM预训练期间是未见过的。现有方法通过昂贵的多阶段流程来应对这一问题,这些流程不仅需要建立SID的映射,还要引出显式解释,但未能清晰阐明各阶段在何时及为何必要。本文系统性地拆解了基于LLM的GR的显式推理训练流程,揭示了三个关键局限:世界知识语言化能力减弱、SID与自然语言标记嵌入空间的对齐偏差,以及对解释质量的敏感性——这些问题均会损害显式推理性能。为规避这些问题,我们提出PauseRec——一种专为GR设计的轻量级隐式推理范式。PauseRec极具实用性,它避免了昂贵的推理轨迹获取和推理对齐训练,从而带来了多重优势:(1)性能比标准显式思维链方法提升高达6.22%;(2)训练成本降低多达65% GPU小时;(3)推理速度提升高达71.3%。这些结果使PauseRec成为显式解释生成的轻量级替代方案,能够实现更高效、更有效的基于LLM的GR。