A Grouped Sorting Queue Supporting Dynamic Updates for Timer Management in High-Speed Network Interface Cards

With the hardware offloading of network functions, network interface cards (NICs) undertake massive stateful, high-precision, and high-throughput tasks, where timers serve as a critical enabling component. However, existing timer management schemes suffer from heavy software load, low precision, lack of hardware update support, and overflow. This paper proposes two novel operations for priority queues--update and group sorting--to enable hardware timer management. To the best of our knowledge, this work presents the first hardware priority queue to support an update operation through the composition and propagation of basic operations to modify the priorities of elements within the queue. The group sorting mechanism ensures correct timing behavior post-overflow by establishing a group boundary priority to alter the sorting process and element insertion positions. Implemented with a hybrid architecture of a one-dimension (1D) systolic array and shift registers, our design is validated through packet-level simulations for flow table timeout management. Results demonstrate that a 4K-depth, 16-bit timer queue achieves over 500 MHz (175 Mpps, 12 ns precision) in a 28nm process and over 300 MHz (116 Mpps) on an FPGA. Critically, it reduces LUTs and FFs usage by 31% and 25%, respectively, compared to existing designs.

翻译：随着网络功能的硬件卸载，网络接口卡（NIC）承担了大量有状态、高精度、高吞吐量的任务，其中定时器作为关键的使能组件。然而，现有的定时器管理方案存在软件负载重、精度低、缺乏硬件更新支持以及溢出等问题。本文提出了优先级队列的两种新颖操作——更新和分组排序——以实现硬件定时器管理。据我们所知，这项研究首次提出了一种通过基本操作的组合与传播来修改队列内元素优先级，从而支持更新操作的硬件优先级队列。分组排序机制通过建立组边界优先级来改变排序过程和元素插入位置，确保了溢出后定时行为的正确性。我们的设计采用一维脉动阵列和移位寄存器的混合架构实现，并通过数据包级仿真在流表超时管理场景中进行了验证。结果表明，一个深度为4K、位宽为16位的定时器队列在28nm工艺下可实现超过500 MHz（175 Mpps，12 ns精度）的工作频率，在FPGA上可实现超过300 MHz（116 Mpps）的频率。关键的是，与现有设计相比，它分别将LUT和FF的使用量减少了31%和25%。

相关内容

排序

关注 313

排序是计算机内经常进行的一种操作，其目的是将一组“无序”的记录序列调整为“有序”的记录序列。分内部排序和外部排序。若整个排序过程不需要访问外存便能完成，则称此类排序问题为内部排序。反之，若参加排序的记录数量很大，整个序列的排序过程不可能在内存中完成，则称此类排序问题为外部排序。内部排序的过程是一个逐步扩大记录的有序序列长度的过程。

【KDD2024】CAFO：基于特征的时间序列分类解释

专知会员服务

25+阅读 · 2024年6月5日

基于深度学习的时间序列分类研究综述

专知会员服务

83+阅读 · 2024年1月8日

时间序列如何用自监督？浙大最新《自监督学习时间序列分析：分类、进展与展望》

专知会员服务

72+阅读 · 2023年6月24日

【AAAI2023】统一序列更好:时间间隔感知数据增强的序列推荐

专知会员服务

16+阅读 · 2022年12月31日