This paper aims to apply two major scaling transformations from the computing packaging industry to internet routers: the heterogeneous integration of high-bandwidth memories (HBMs) and chiplets, as well as in-package optics. We propose a novel internet router architecture that employs these technologies to achieve a petabit/sec router within a single integrated package. At the top-level, we introduce a novel split-parallel switch architecture that spatially divides (without processing) the incoming fibers and distributes them across smaller independent switches without intermediate OEO conversions or fine-tuned per-packet load-balancing. This passive spatial division enables scaling at the cost of a coarser traffic load balancing. Yet, through extensive evaluations of backbone network traffic, we demonstrate that differences with fine-tuned approaches are small. In addition, we propose a novel HBM-based shared-memory architecture for the implementation of the smaller independent switches, and we introduce a novel parallel frame interleaving algorithm that packs traffic into frames so that HBM banks are accessed at peak HBM data rates in a cyclical interleaving manner. We further discuss why these new technologies represent a paradigm shift in the design of future internet routers. Finally, we emphasize that power consumption may constitute the primary bottleneck to scaling.
翻译:本文旨在将计算封装行业的两大扩展技术应用于互联网路由器:高带宽内存(HBM)与芯粒的异构集成,以及封装内光学技术。我们提出了一种新型互联网路由器架构,通过采用这些技术,在单个集成封装内实现每秒拍比特级的路由器。在顶层,我们引入了一种新型的分割并行交换架构,该架构在空间上(无需处理)分割输入光纤,并将其分配到多个较小的独立交换机中,无需中间的光电光转换或精细的逐包负载均衡。这种被动的空间分割以更粗粒度的流量负载均衡为代价实现了扩展。然而,通过对骨干网络流量的广泛评估,我们证明其与精细调优方法之间的差异很小。此外,我们提出了一种基于HBM的共享内存架构,用于实现这些较小的独立交换机,并引入了一种新颖的并行帧交织算法,该算法将流量打包成帧,使得HBM存储体能够以峰值HBM数据速率按循环交织方式访问。我们进一步讨论了为何这些新技术代表了未来互联网路由器设计的范式转变。最后,我们强调功耗可能成为扩展的主要瓶颈。