This paper presents the development process of a natural language to SQL model using the T5 model as the basis. The models, developed in August 2022 for an online transaction processing system and a data warehouse, have a 73\% and 84\% exact match accuracy respectively. These models, in conjunction with other work completed in the research project, were implemented for several companies and used successfully on a daily basis. The approach used in the model development could be implemented in a similar fashion for other database environments and with a more powerful pre-trained language model.
翻译:本文介绍了基于T5模型开发自然语言转SQL模型的过程。这些模型于2022年8月为在线交易处理系统和数据仓库开发,其精确匹配准确率分别达到73%和84%。这些模型与研究项目中完成的其他工作相结合,已在多家公司部署并成功实现日常应用。模型开发所采用的方法可类似地应用于其他数据库环境,并可配合更强大的预训练语言模型使用。