This report overviews our ongoing work in enriching chain-of-thoughts datasets requiring arithmetical reasoning with the integration of non-parametric components, such as a calculator. We conduct an analysis of prominent relevant datasets such as GSM8K, Ape210K, AQuA-RAT, and MathQA and propose a machine-processable HTML-like format specifically tailored for working with semi-structured chains. By converting the datasets into this unified format, we enable the effective integration of large language models and symbolic systems, empowering them to tackle arithmetical reasoning tasks more efficiently.
翻译:论文摘要:本报告概述了我们当前在算术推理思维链数据集中集成非参数化组件(如计算器)以丰富数据集的持续工作。我们对GSM8K、Ape210K、AQuA-RAT和MathQA等主要相关数据集进行了分析,并提出了一种专为处理半结构化链而设计的机器可处理的类似HTML的格式。通过将这些数据集转换为这种统一格式,我们能够有效整合大型语言模型与符号系统,从而更高效地应对算术推理任务。