Predicting web service traffic has significant social value, as it can be applied to various practical scenarios, including but not limited to dynamic resource scaling, load balancing, system anomaly detection, service-level agreement compliance, and fraud detection. Web service traffic is characterized by frequent and drastic fluctuations over time and are influenced by heterogeneous web user behaviors, making accurate prediction a challenging task. Previous research has extensively explored statistical approaches, and neural networks to mine features from preceding service traffic time series for prediction. However, these methods have largely overlooked the causal relationships between services. Drawing inspiration from causality in ecological systems, we empirically recognize the causal relationships between web services. To leverage these relationships for improved web service traffic prediction, we propose an effective neural network module, CCMPlus, designed to extract causal relationship features across services. This module can be seamlessly integrated with existing time series models to consistently enhance the performance of web service traffic predictions. We theoretically justify that the causal correlation matrix generated by the CCMPlus module captures causal relationships among services. Empirical results on real-world datasets from Microsoft Azure, Alibaba Group, and Ant Group confirm that our method surpasses state-of-the-art approaches in Mean Squared Error (MSE) and Mean Absolute Error (MAE) for predicting service traffic time series. These findings highlight the efficacy of leveraging causal relationships for improved predictions.
翻译:网络服务流量预测具有重要的社会价值,可应用于多种实际场景,包括但不限于动态资源扩缩容、负载均衡、系统异常检测、服务等级协议合规性监测以及欺诈检测。网络服务流量随时间频繁剧烈波动,并受异构网络用户行为影响,使得准确预测成为一项具有挑战性的任务。先前研究广泛探索了统计方法与神经网络,旨在从历史服务流量时间序列中挖掘特征进行预测。然而,这些方法大多忽视了服务间的因果关系。受生态系统因果关系的启发,我们通过实证研究识别了网络服务间的因果关系。为利用这些关系改进网络服务流量预测,我们提出一种有效的神经网络模块CCMPlus,该模块旨在提取跨服务的因果关系特征。该模块可与现有时间序列模型无缝集成,持续提升网络服务流量预测的性能。我们从理论上论证了CCMPlus模块生成的因果相关矩阵能够捕捉服务间的因果关系。在微软Azure、阿里巴巴集团和蚂蚁集团的真实数据集上的实验结果表明,本方法在预测服务流量时间序列的均方误差(MSE)和平均绝对误差(MAE)指标上均优于现有先进方法。这些发现凸显了利用因果关系提升预测效果的有效性。