Muchisim: A Simulation Framework for Design Exploration of Multi-Chip Manycore Systems

Current design-space exploration tools cannot accurately evaluate communication-intensive applications whose execution is data-dependent (e.g., graph analytics and sparse linear algebra) on scale-out manycore systems, due to either lack of scalability or lack of detail in modeling the network. This paper presents Muchisim, a novel parallel simulator designed to address the challenges in exploring the design space of distributed multi-chiplet manycore architectures for communication-intensive applications. We evaluate Muchisim at simulating systems with up to a million interconnected processing elements (PEs) while modeling data movement and communication in a cycle-accurate manner. In addition to performance, Muchisim reports the energy, area, and cost of the simulated system, and it comes with a benchmark application suite and two data visualization tools. Muchisim supports various parallelization strategies and communication primitives such as task-based parallelization and message passing, making it highly relevant for architectures with software-managed coherence and distributed memory. Via a case study, we show that Muchisim helps users explore the balance between memory and computation units and the constraints related to chiplet integration and inter-chip communication. Muchisim enables scaling up the systems in which new techniques or design parameters are evaluated, opening the gate for further research in this area.

翻译：当前的探索设计空间工具无法准确评估执行依赖于数据的通信密集型应用（例如图分析和稀疏线性代数）在可扩展众核系统上的性能，原因在于缺乏可扩展性或对网络建模的细节不足。本文提出了Muchisim，一种新型并行仿真器，旨在解决针对通信密集型应用的多芯片分布式众核架构的设计空间探索挑战。我们评估了Muchisim在模拟包含多达一百万个互连处理单元（PE）的系统时的性能，同时以周期精确的方式建模数据移动和通信。除性能外，Muchisim还报告仿真系统的能耗、面积和成本，并附带基准测试应用套件和两个数据可视化工具。Muchisim支持多种并行化策略和通信原语，例如基于任务的并行化和消息传递，使其在具有软件管理一致性和分布式内存的架构中高度相关。通过案例研究，我们展示了Muchisim帮助用户探索内存与计算单元之间的平衡，以及与芯片集成和芯片间通信相关的约束。Muchisim能够扩展系统规模以评估新技术或设计参数，为这一领域的进一步研究打开了大门。

相关内容

TOOLS

关注 1

这个新版本的工具会议系列恢复了从1989年到2012年的50个会议的传统。工具最初是“面向对象语言和系统的技术”，后来发展到包括软件技术的所有创新方面。今天许多最重要的软件概念都是在这里首次引入的。2019年TOOLS 50+1在俄罗斯喀山附近举行，以同样的创新精神、对所有与软件相关的事物的热情、科学稳健性和行业适用性的结合以及欢迎该领域所有趋势和社区的开放态度，延续了该系列。官网链接：http://tools2019.innopolis.ru/

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日