Logs are a fundamental component of modern computer systems. They enable the analysis and monitoring teams to understand any abnormal or malicious behavior that may have occurred. The continuous increase in the volume of logs generated by these systems made it unsuitable for manual inspection and represents a real challenge with regard to process automation. In order to process these data, several log-structuring solutions have been developed. In this article, we analyze the capabilities of two solutions in order to meet the challenges of Cloud Computing in terms of efficiency and effectiveness. Our work focuses on the impact of parameterization and preprocessing on the performance of these methods -- two important steps as they require human intervention, which is incompatible with with the automation of the log-structuring process.
翻译:日志是现代计算机系统的基本组成部分。它们使分析和监控团队能够了解可能发生的任何异常或恶意行为。这些系统生成的日志量持续增加,使得人工检查变得不切实际,并对流程自动化构成了真正的挑战。为了处理这些数据,已开发出多种日志结构化解决方案。本文分析了两种解决方案的能力,以应对云计算在效率和效果方面的挑战。我们的研究重点关注参数化和预处理对这些方法性能的影响——这两个重要步骤需要人工干预,这与日志结构化过程的自动化不相容。