We present EXAONE Deep series, which exhibits superior capabilities in various reasoning tasks, including math and coding benchmarks. We train our models mainly on the reasoning-specialized dataset that incorporates long streams of thought processes. Evaluation results show that our smaller models, EXAONE Deep 2.4B and 7.8B, outperform other models of comparable size, while the largest model, EXAONE Deep 32B, demonstrates competitive performance against leading open-weight models. All EXAONE Deep models are openly available for research purposes and can be downloaded from https://huggingface.co/LGAI-EXAONE.
翻译:我们推出EXAONE Deep系列模型,该系列在包括数学与编程基准测试在内的多种推理任务中展现出卓越能力。我们主要基于包含长链思维过程的推理专用数据集对模型进行训练。评估结果表明,我们较小的模型EXAONE Deep 2.4B与7.8B在同等规模模型中表现优异,而最大规模的EXAONE Deep 32B模型在与主流开源权重模型的对比中展现出竞争力。所有EXAONE Deep模型均面向研究用途开放,可通过https://huggingface.co/LGAI-EXAONE下载获取。