The explosion of data in recent years is driving individuals to leverage technology to generate insights. Traditional tools bring heavy learning overheads and the requirement for understanding complex charting techniques. Such barriers can hinder those who may benefit from harnessing data for informed decision making. The emerging field of generating data visualisations from natural language text (NL2VIS) addresses this issue. This study showcases Chat2VIS, a state-of-the-art NL2VIS solution. It capitalises on the latest in AI technology with the upsurge in pre-trained large language models (LLMs) such as GPT-3, Codex, and ChatGPT. Furthermore, the rise in natural language interfaces (NLI) and chatbots is taking centre stage. This work illustrates how Chat2VIS leverages similar techniques to fine-tune data visualisation components beyond that demonstrated in previous approaches. In addition, this paper presents the flexibility of Chat2VIS to comprehend multilingual natural language requests. No other NL2VIS system has demonstrated this unique talent. In concluding, this research provides quantitative benchmarking evaluations to contribute to the paucity of NL2VIS standards.
翻译:近年来数据的爆炸式增长促使人们利用技术手段获取洞见。传统工具存在学习成本高、需掌握复杂图表技术等问题,这些障碍可能阻碍那些本可借助数据驱动决策的受益者。从自然语言文本生成数据可视化的新兴领域(NL2VIS)正是针对这一难题。本研究展示了Chat2VIS这一前沿NL2VIS解决方案,它依托最新人工智能技术,借助GPT-3、Codex、ChatGPT等预训练大语言模型(LLM)的爆发式发展。同时,自然语言界面(NLI)与聊天机器人的兴起正成为核心关注点。本研究阐明了Chat2VIS如何运用类似技术,在前人方法基础上进一步优化数据可视化组件的微调能力。此外,本文还呈现了Chat2VIS理解多语言自然语言请求的灵活性——目前尚无其他NL2VIS系统展现过这一独特能力。最后,本研究通过定量基准评估,为弥补NL2VIS标准体系的缺失提供了数据支撑。