Risk-Aware and Stable Edge Server Selection Under Network Latency SLOs

We present a lightweight and interpretable decision framework for dynamic edge server selection in latency-critical applications that explicitly accounts for tail risk and switching stability. Each candidate server is characterised by predictive mean and uncertainty summaries of network latency, which are used to estimate the risk of service-level objective (SLO) violations and to guide selection. Risk is evaluated using a tight Normal approximation complemented by a conservative Cantelli bound, while percentile-based scoring coupled with hysteresis stabilizes decisions and suppresses oscillatory switching under short-lived network fluctuations. Experimental results on a multi-server edge testbed with a strict SLO of $τ= 0.5$\,s show that the proposed approach reduces the deadline-miss rate from 39\% to 34\% compared to a mean-only baseline, while reducing switching frequency from 46\% to 5.5\% ($\approx$88\% reduction) and maintaining sub-SLO average latency ($\approx$0.45\,s). These results demonstrate that explicit risk evaluation combined with stability-preserving control enables practical and robust adaptive server selection in dynamic edge environments.

翻译：我们提出了一种轻量级且可解释的决策框架，用于延迟关键型应用中的动态边缘服务器选择，该框架明确考虑了尾部风险与切换稳定性。每个候选服务器以网络延迟的预测均值和不确定性概述为特征，用于估计服务等级协议违反的风险并指导选择。风险评估采用严格的正态近似补充保守的Cantelli界，而基于百分位的评分与滞后机制相结合，可稳定决策并抑制短期网络波动下的振荡切换。在具有严格服务等级协议$τ=0.5$秒的多服务器边缘测试床上的实验结果表明，与仅基于均值的基线相比，所提方法将截止时间错过率从39%降低到34%，同时将切换频率从46%降低到5.5%（约降低88%），并保持低于服务等级协议的平均延迟（约0.45秒）。这些结果证明，显式风险评估与稳定性保持控制相结合，能够在动态边缘环境中实现实用且鲁棒的自适应服务器选择。

相关内容

服务器

关注 14

服务器，也称伺服器，是提供计算服务的设备。由于服务器需要响应服务请求，并进行处理，因此一般来说服务器应具备承担服务并且保障服务的能力。
服务器的构成包括处理器、硬盘、内存、系统总线等，和通用的计算机架构类似，但是由于需要提供高可靠的服务，因此在处理能力、稳定性、可靠性、安全性、可扩展性、可管理性等方面要求较高。

《军事任务为中心网络安全风险评估中的不确定性》

专知会员服务

10+阅读 · 5月18日

基于脉冲神经网络的边缘智能

专知会员服务

21+阅读 · 2025年7月23日

《用于边缘云异常检测的机器学习》博士论文

专知会员服务

24+阅读 · 2025年1月20日

《边缘云异常检测的机器学习》最新博士论文

专知会员服务

27+阅读 · 2024年8月8日