Accel-Sim is a widely used computer architecture simulator that models the behavior of modern NVIDIA GPUs in great detail. However, although Accel-Sim and the underlying GPGPU-Sim model many of the features of real GPUs, thus far it has not been able to track statistics separately per stream. Instead, Accel-Sim combines statistics (e.g., cycles and cache hits/misses) across all simultaneously running streams. This can prevent users from properly identifying the behavior of specific kernels and streams and potentially lead to incorrect conclusions. Thus, in this work we extend Accel-Sim's and GPGPU-Sim's statistic tracking support to track per-stream statistics. To validate this support, we designed a series of multi-stream microbenchmarks and checked their reported per-kernel, per-stream counts.
翻译:Accel-Sim是一款广泛使用的计算机体系结构模拟器,能够详细模拟现代NVIDIA GPU的行为。然而,尽管Accel-Sim及其底层GPGPU-Sim模型模拟了真实GPU的诸多特性,但其至今未能实现对每流统计的独立追踪。相反,Accel-Sim会将所有并发运行流的统计数据(如周期数、缓存命中/缺失)合并处理。这可能导致用户无法准确识别特定内核与流的行为,甚至得出错误结论。为此,本研究扩展了Accel-Sim与GPGPU-Sim的统计追踪功能,使其支持每流统计。为验证该功能,我们设计了一系列多流微基准测试,并检验了其报告的每内核、每流计数结果。