The march toward developing relevant and robust CPU benchmarks continues with the introduction of SPEC CPU 2026, the next generation suite for measuring processor performance. This paper details the methodology behind its creation, showcasing a process centered on community collaboration and principled development. The suite is built upon a foundation of modern, open-source applications, selected and hardened through a process that emphasizes workload diversity, portability, and software longevity. A key contribution is Rolling-Round-Robin Rate, a novel and standardized approach to running heterogeneous, multiprogrammed workloads that addresses a long-standing gap in benchmarking practice. Additionally, the suite features an expanded set of multithreaded benchmarks and introduces workloads with distinct microarchitectural profiles, reflecting the demands of contemporary software. By detailing our principled approach to benchmark selection, adaptation, and validation, we demonstrate how the SPEC CPU 2026 suite sets the standard for performance evaluation in the next era of computer architecture research and development.
翻译:随着SPEC CPU 2026——新一代处理器性能测量套件的发布,开发相关且稳健的CPU基准测试工作持续推进。本文详细阐述了其创建方法,展示了一个以社区协作和规范化开发为核心的流程。该套件基于现代开源应用程序构建,通过强调工作负载多样性、可移植性及软件长期维护性的流程进行筛选与加固。一项关键贡献是滚动轮询速率(Rolling-Round-Robin Rate),这是一种用于异构多程序工作负载的新型标准化方法,弥补了基准测试实践中长期存在的空白。此外,该套件扩展了多线程基准测试集,并引入了具有独特微架构特征的工作负载,反映了当代软件的需求。通过详述我们在基准测试选择、适配与验证中的规范化方法,我们展示了SPEC CPU 2026套件如何为下一代计算机架构研究与开发中的性能评估树立标杆。