Deceptive reviews, refer to fabricated feedback designed to artificially manipulate the perceived quality of products. Within modern e-commerce ecosystems, these reviews remain a critical governance challenge. Despite advances in review-level and graph-based detection methods, two pivotal limitations remain: inadequate generalization and lack of interpretability. To address these challenges, we propose JARVIS, a framework providing Judgment via Augmented Retrieval and eVIdence graph Structures. Starting from the review to be evaluated, it retrieves semantically similar evidence via hybrid dense-sparse multimodal retrieval, expands relational signals through shared entities, and constructs a heterogeneous evidence graph. Large language model then performs evidence-grounded adjudication to produce interpretable risk assessments. Offline experiments demonstrate that JARVIS enhances performance on our constructed review dataset, achieving a precision increase from 0.953 to 0.988 and a recall boost from 0.830 to 0.901. In the production environment, our framework achieves a 27% increase in the recall volume and reduces manual inspection time by 75%. Furthermore, the adoption rate of the model-generated analysis reaches 96.4%.
翻译:虚假评论指为人为操纵产品感知质量而编造的反馈。在现代电子商务生态系统中,此类评论仍是关键治理难题。尽管评论级和图检测方法已取得进展,仍存在两个关键局限:泛化能力不足与可解释性缺失。为应对这些挑战,我们提出JARVIS框架,通过增强检索与证据图结构进行判定。该框架以待评估评论为起点,通过混合稠密-稀疏多模态检索获取语义相似的证据,借助共享实体扩展关系信号,并构建异质证据图。随后大型语言模型执行基于证据的判定,生成可解释的风险评估。离线实验表明,JARVIS在我们构建的评论数据集上提升了性能,精确率从0.953提高至0.988,召回率从0.830提升至0.901。在生产环境中,本框架实现召回量增长27%,人工审核时间减少75%。此外,模型生成分析的采纳率达到96.4%。