Container technology, as the key enabler behind microservice architectures, is widely applied in Cloud and Edge Computing. A long and continuous running of operating system (OS) host-ing container-based services can encounter software aging that leads to performance deterioration and even causes system fail-ures. OS rejuvenation techniques can mitigate the impact of software aging but the rejuvenation trigger interval needs to be carefully determined to reduce the downtime cost due to rejuve-nation. This paper proposes a comprehensive semi-Markov-based approach to quantitatively evaluate the effect of OS reju-venation on the dependability and the performance of a con-tainer-based service. In contrast to the existing studies, we nei-ther restrict the distributions of time intervals of events to be exponential nor assume that backup resources are always avail-able. Through the numerical study, we show the optimal con-tainer-migration trigger intervals that can maximize the de-pendability or minimize the performance of a container-based service.
翻译:容器技术作为微服务架构的关键支撑,广泛应用于云计算与边缘计算领域。长时间连续运行承载容器化服务的操作系统主机会遭遇软件老化问题,导致性能衰退甚至引发系统故障。操作系统修复技术可缓解软件老化影响,但需精确确定修复触发间隔以降低修复导致的服务中断成本。本文提出一种基于半马尔可夫的综合方法,定量评估操作系统修复对容器服务可靠性与性能的影响。与现有研究不同,我们既不限制事件时间间隔的分布形式为指数分布,也不假设备份资源始终可用。通过数值分析,我们展示了能够最大化容器服务可靠性或最小化其性能的最优容器迁移触发间隔。