Tables are prevalent in real-world databases, requiring significant time and effort for humans to analyze and manipulate. The advancements in large language models (LLMs) have made it possible to interact with tables using natural language input, bringing this capability closer to reality. In this paper, we present TableGPT, a unified fine-tuned framework that enables LLMs to understand and operate on tables using external functional commands. It introduces the capability to seamlessly interact with tables, enabling a wide range of functionalities such as question answering, data manipulation (e.g., insert, delete, query, and modify operations), data visualization, analysis report generation, and automated prediction. TableGPT aims to provide convenience and accessibility to users by empowering them to effortlessly leverage tabular data. At the core of TableGPT lies the novel concept of global tabular representations, which empowers LLMs to gain a comprehensive understanding of the entire table beyond meta-information. By jointly training LLMs on both table and text modalities, TableGPT achieves a deep understanding of tabular data and the ability to perform complex operations on tables through chain-of-command instructions. Importantly, TableGPT offers the advantage of being a self-contained system rather than relying on external API interfaces. Moreover, it supports efficient data process flow, query rejection (when appropriate) and private deployment, enabling faster domain data fine-tuning and ensuring data privacy, which enhances the framework's adaptability to specific use cases.
翻译:表格在现实世界数据库中普遍存在,人类需要耗费大量时间和精力进行分析与操作。大语言模型(LLMs)的进步使得通过自然语言输入与表格交互成为可能,这一能力正逐步接近现实。本文提出TableGPT——一个统一的微调框架,使LLMs能够利用外部功能命令理解与操作表格。该框架引入了与表格无缝交互的能力,支持广泛功能,包括问答、数据操作(如插入、删除、查询与修改)、数据可视化、分析报告生成以及自动化预测。TableGPT旨在通过使用户轻松利用表格数据,提供便利性与可及性。其核心在于全局表格表示这一创新概念,使LLMs能够超越元信息全面理解整个表格。通过在表格与文本两种模态上联合训练LLMs,TableGPT实现了对表格数据的深刻理解,并能通过指令链对表格执行复杂操作。重要的是,TableGPT具有作为自包含系统的优势,无需依赖外部API接口。此外,它还支持高效数据处理流程、合理查询拒绝以及私密部署,从而实现更快速的领域数据微调并确保数据隐私,增强了框架对特定用例的适应性。