Large language models (LLMs) have shown great potential to solve varieties of natural language processing (NLP) tasks, including mathematical reasoning. In this work, we present SkyMath, a large language model for mathematics with 13 billion parameters. By applying self-compare fine-tuning, we have enhanced mathematical reasoning abilities of Skywork-13B-Base remarkably. On GSM8K, SkyMath outperforms all known open-source models of similar size and has established a new SOTA performance.
翻译:大型语言模型(LLMs)在解决自然语言处理(NLP)任务(包括数学推理)方面展现出巨大潜力。本文提出SkyMath,一个拥有130亿参数的数学专用大语言模型。通过应用自比对微调技术,我们显著提升了Skywork-13B-Base的数学推理能力。在GSM8K基准测试中,SkyMath超越了所有已知同等规模的开源模型,并创下了新的SOTA性能记录。