Improving Generalization in Semantic Parsing by Increasing Natural Language Variation

Text-to-SQL semantic parsing has made significant progress in recent years, with various models demonstrating impressive performance on the challenging Spider benchmark. However, it has also been shown that these models often struggle to generalize even when faced with small perturbations of previously (accurately) parsed expressions. This is mainly due to the linguistic form of questions in Spider which are overly specific, unnatural, and display limited variation. In this work, we use data augmentation to enhance the robustness of text-to-SQL parsers against natural language variations. Existing approaches generate question reformulations either via models trained on Spider or only introduce local changes. In contrast, we leverage the capabilities of large language models to generate more realistic and diverse questions. Using only a few prompts, we achieve a two-fold increase in the number of questions in Spider. Training on this augmented dataset yields substantial improvements on a range of evaluation sets, including robustness benchmarks and out-of-domain data.

翻译：文本到SQL语义解析近年来取得了显著进展，各类模型在具有挑战性的Spider基准测试中展现出卓越性能。然而研究表明，即使面对先前已准确解析表达的微小扰动，这些模型仍难以实现有效泛化。这主要源于Spider中问题的语言形式过于特定、不自然且缺乏变体多样性。本研究通过数据增强技术提升文本到SQL解析器对自然语言变体的鲁棒性。现有方法要么借助Spider训练的模型生成问题改写，要么仅引入局部修改。与之相反，我们利用大语言模型的能力生成更真实且多样化的问题。仅通过少量提示词，我们就实现了Spider问题数量的两倍增长。基于该增强数据集的训练在包括鲁棒性基准测试和域外数据在内的多项评估集上均取得了显著改进。

相关内容

网络爬虫

关注 13

网络爬虫（又被称为网页蜘蛛，网络机器人，在FOAF社区中间，更经常被称为网页追逐者），是一种按照一定的规则，自动的抓取万维网信息的程序或者脚本，已被广泛应用于互联网领域。搜索引擎使用网络爬虫抓取Web网页、文档甚至图片、音频、视频等资源，通过相应的索引技术组织这些信息，提供给搜索用户进行查询。网络爬虫也为中小站点的推广提供了有效的途径。

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日