Process mining is a well-established discipline of data analysis focused on the discovery of process models from information systems' event logs. Recently, an emerging subarea of process mining - stochastic process discovery has started to evolve. Stochastic process discovery considers frequencies of events in the event data and allows for more comprehensive analysis. In particular, when durations of activities are presented in the event log, performance characteristics of the discovered stochastic models can be analyzed, e.g., the overall process execution time can be estimated. Existing performance analysis techniques usually discover stochastic process models from event data and then simulate these models to evaluate their execution times. These methods rely on empirical approaches. This paper proposes analytical techniques for performance analysis allowing for the derivation of statistical characteristics of the overall processes' execution times in the presence of arbitrary time distributions of events modeled by semi-Markov processes. The proposed methods can significantly simplify the what-if analysis of processes by providing solutions without resorting to simulation.
翻译:流程挖掘是一门成熟的数据分析学科,专注于从信息系统的事件日志中发现过程模型。近年来,流程挖掘的一个新兴子领域——随机过程发现开始发展。随机过程发现考虑了事件数据中事件的频率,使得更全面的分析成为可能。特别是当事件日志中包含活动的持续时间时,可以分析所发现的随机模型的性能特征,例如估算整体流程执行时间。现有的性能分析技术通常从事件数据中发现随机过程模型,然后模拟这些模型以评估其执行时间。这些方法依赖于经验方法。本文提出了用于性能分析的解析技术,能够在半马尔可夫过程建模的事件具有任意时间分布的情况下,推导出整体流程执行时间的统计特征。所提出的方法无需借助模拟即可提供解,从而显著简化流程的假设分析。