MareNostrum5 is a pre-exascale supercomputer at the Barcelona Supercomputing Center (BSC), part of the EuroHPC Joint Undertaking. With a peak performance of 314 petaflops, MareNostrum5 features a hybrid architecture comprising Intel Sapphire Rapids CPUs, NVIDIA Hopper GPUs, and DDR5 and high-bandwidth memory (HBM), organized into four partitions optimized for diverse workloads. This document evaluates MareNostrum5 through micro-benchmarks (floating-point performance, memory bandwidth, interconnect throughput), HPC benchmarks (HPL and HPCG), and application studies using Alya, OpenFOAM, and IFS. It highlights MareNostrum5's scalability, efficiency, and energy performance, utilizing the EAR (Energy Aware Runtime) framework to assess power consumption and the effects of direct liquid cooling. Additionally, HBM and DDR5 configurations are compared to examine memory performance trade-offs. Designed to complement standard technical documentation, this study provides insights to guide both new and experienced users in optimizing their workloads and maximizing MareNostrum5's computational capabilities.
翻译:MareNostrum5 是巴塞罗那超级计算中心(BSC)部署的一台近E级超级计算机,隶属于欧洲高性能计算联合计划(EuroHPC)。该系统峰值性能达 314 Petaflops,采用混合架构,包含英特尔 Sapphire Rapids CPU、英伟达 Hopper GPU 以及 DDR5 和高带宽内存(HBM),并组织为四个针对不同工作负载优化的分区。本文通过微基准测试(浮点性能、内存带宽、互连吞吐量)、高性能计算基准测试(HPL 和 HPCG)以及使用 Alya、OpenFOAM 和 IFS 的应用研究,对 MareNostrum5 进行了评估。研究重点分析了 MareNostrum5 的可扩展性、效率及能效表现,利用能量感知运行时(EAR)框架评估了功耗及直接液冷的效果。此外,通过比较 HBM 与 DDR5 配置,探讨了内存性能的权衡。本研究旨在补充标准技术文档,为经验丰富或新入门的用户提供指导,以优化其工作负载并充分利用 MareNostrum5 的计算能力。