DESiRED -- Dynamic, Enhanced, and Smart iRED: A P4-AQM with Deep Reinforcement Learning and In-band Network Telemetry

Active Queue Management (AQM) is a mechanism employed to alleviate transient congestion in network device buffers, such as routers and switches. Traditional AQM algorithms use fixed thresholds, like target delay or queue occupancy, to compute random packet drop probabilities. A very small target delay can increase packet losses and reduce link utilization, while a large target delay may increase queueing delays while lowering drop probability. Due to dynamic network traffic characteristics, where traffic fluctuations can lead to significant queue variations, maintaining a fixed threshold AQM may not suit all applications. Consequently, we explore the question: \textit{What is the ideal threshold (target delay) for AQMs?} In this work, we introduce DESiRED (Dynamic, Enhanced, and Smart iRED), a P4-based AQM that leverages precise network feedback from In-band Network Telemetry (INT) to feed a Deep Reinforcement Learning (DRL) model. This model dynamically adjusts the target delay based on rewards that maximize application Quality of Service (QoS). We evaluate DESiRED in a realistic P4-based test environment running an MPEG-DASH service. Our findings demonstrate up to a 90x reduction in video stall and a 42x increase in high-resolution video playback quality when the target delay is adjusted dynamically by DESiRED.

翻译：主动队列管理（AQM）是一种用于缓解网络设备（如路由器和交换机）缓冲区瞬时拥塞的机制。传统AQM算法采用固定阈值（如目标延迟或队列占用率）来计算随机丢包概率。过小的目标延迟可能增加丢包率并降低链路利用率，而过大的目标延迟虽能降低丢包概率但可能增大排队延迟。由于网络流量具有动态特性，流量波动会导致队列显著变化，因此固定阈值的AQM可能无法适应所有应用场景。基于此，我们探讨了以下问题：*AQM的理想阈值（目标延迟）应如何确定？* 本文提出DESiRED（动态、增强与智能iRED），一种基于P4的AQM方案。该方法利用带内网络遥测（INT）提供的精准网络反馈，驱动深度强化学习（DRL）模型，根据最大化应用服务质量（QoS）的奖励机制动态调整目标延迟。我们在运行MPEG-DASH服务的真实P4测试环境中评估了DESiRED的性能。实验结果表明，当DESiRED动态调整目标延迟时，视频卡顿可减少90倍，高分辨率视频播放质量提升42倍。

相关内容

Networking

关注 23

Networking：IFIP International Conferences on Networking。 Explanation：国际网络会议。 Publisher：IFIP。 SIT： http://dblp.uni-trier.de/db/conf/networking/index.html

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

Query2box: 使用盒嵌入对向量空间中的知识图谱进行推理，Query2box: Reasoning over Knowledge Graphs in Vector Space Using Box Embeddings

专知会员服务

46+阅读 · 2020年5月11日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日