This technical report presents K-EXAONE, a large-scale multilingual language model developed by LG AI Research. K-EXAONE is built on a Mixture-of-Experts architecture with 236B total parameters, activating 23B parameters during inference. It supports a 256K-token context window and covers six languages: Korean, English, Spanish, German, Japanese, and Vietnamese. We evaluate K-EXAONE on a comprehensive benchmark suite spanning reasoning, agentic, general, Korean, and multilingual abilities. Across these evaluations, K-EXAONE demonstrates performance comparable to open-weight models of similar size. K-EXAONE, designed to advance AI for a better life, is positioned as a powerful proprietary AI foundation model for a wide range of industrial and research applications.
翻译:本技术报告介绍了由LG AI Research开发的大规模多语言语言模型K-EXAONE。K-EXAONE基于混合专家架构构建,总参数量为2360亿,推理时激活参数量为230亿。它支持256K令牌的上下文窗口,并涵盖六种语言:韩语、英语、西班牙语、德语、日语和越南语。我们在涵盖推理、代理、通用、韩语及多语言能力的综合基准测试套件上评估了K-EXAONE。在这些评估中,K-EXAONE展现出与相似规模的开源权重模型相当的性能。K-EXAONE旨在通过人工智能推动更美好的生活,被定位为一款强大的专有AI基础模型,适用于广泛的工业和研究应用。