We report on a systematic, PRISMA-guided survey of research at the intersection of LLMs and visualization, with a particular focus on visio-verbal interaction -- where verbal and visual modalities converge to support data sense-making. The emergence of Large Language Models (LLMs) has introduced new paradigms for interacting with data visualizations through natural language, leading to intuitive, multimodal, and accessible interfaces. We analyze 48 papers across six dimensions: application domain, visualization task, visualization representation, interaction modality, LLM integration, and system evaluation. Our classification framework maps LLM roles across the visualization pipeline, from data querying and transformation to visualization generation, explanation, and navigation. We highlight emerging design patterns, identify gaps in accessibility and visualization reading, and discuss the limitations of current LLMs in spatial reasoning and contextual grounding. We further reflect on evaluations of combined LLM-visualization systems, highlighting how current research projects tackle this challenge and discuss current gaps in conducting meaningful evaluations of such systems. With our survey we aim to guide future research and system design in LLM-enhanced visualization, supporting broad audiences and intelligent, conversational interfaces.
翻译:本文通过一项遵循PRISMA指南的系统性综述,探究了大型语言模型(LLM)与可视化交叉领域的研究,特别聚焦于视-语交互——即语言与视觉模态融合以支持数据感知的交互范式。大型语言模型的出现为通过自然语言与数据可视化进行交互引入了新范式,催生了直观、多模态且易于访问的界面。我们从六个维度分析了48篇论文:应用领域、可视化任务、可视化表征、交互模态、LLM集成方式以及系统评估。我们的分类框架描绘了LLM在整个可视化流程(从数据查询与转换到可视化生成、解释与导航)中所扮演的角色。我们重点阐述了新兴的设计模式,指出了在可访问性与可视化解读方面存在的不足,并讨论了当前LLM在空间推理与情境关联方面的局限性。此外,我们进一步反思了对LLM-可视化组合系统的评估现状,重点分析了当前研究如何应对这一挑战,并探讨了对此类系统进行有效评估方面存在的当前缺口。通过本次综述,我们旨在为LLM增强的可视化领域的未来研究与系统设计提供指引,以支持更广泛的用户群体及智能化的对话式界面。