WikiTableEdit: A Benchmark for Table Editing by Natural Language Instruction

Tabular data, as a crucial form of data representation, exists in diverse formats on the Web. When confronted with complex and irregular tables, manual modification becomes a laborious task. This paper investigates the performance of Large Language Models (LLMs) in the context of table editing tasks. Existing research mainly focuses on regular-shaped tables, wherein instructions are used to generate code in SQL, Python, or Excel Office-script for manipulating the tables. Nevertheless, editing tables with irregular structures, particularly those containing merged cells spanning multiple rows, poses a challenge when using code. To address this, we introduce the WikiTableEdit dataset. Leveraging 26,531 tables from the WikiSQL dataset, we automatically generate natural language instructions for six distinct basic operations and the corresponding outcomes, resulting in over 200,000 instances. Subsequently, we evaluate several representative large language models on the WikiTableEdit dataset to demonstrate the challenge of this task. The dataset will be released to the community to promote related researches.

翻译：表格数据作为数据表示的重要形式，在网络上以多种格式存在。面对复杂且不规则的表格时，手动修改成为一项繁重的工作。本文研究了大型语言模型（LLMs）在表格编辑任务中的表现。现有研究主要关注规则形状的表格，通过指令生成SQL、Python或Excel Office-script代码来操作表格。然而，编辑结构不规则的表格（特别是包含跨多行合并单元格的表格）在使用代码时存在挑战。为解决这一问题，我们引入了WikiTableEdit数据集。利用WikiSQL数据集的26,531个表格，我们自动生成了六种不同基本操作的自然语言指令及对应结果，创建了超过20万个案例。随后，我们在WikiTableEdit数据集上评估了多个代表性大型语言模型，以展示该任务的挑战性。该数据集将向社区开放，以推动相关研究。

相关内容

数据集

关注 88

数据集，又称为资料集、数据集合或资料集合，是一种由数据所组成的集合。
Data set（或dataset）是一个数据的集合，通常以表格形式出现。每一列代表一个特定变量。每一行都对应于某一成员的数据集的问题。它列出的价值观为每一个变量，如身高和体重的一个物体或价值的随机数。每个数值被称为数据资料。对应于行数，该数据集的数据可能包括一个或多个成员。

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日