The need for high-quality automated seizure detection algorithms based on electroencephalography (EEG) becomes ever more pressing with the increasing use of ambulatory and long-term EEG monitoring. Heterogeneity in validation methods of these algorithms influences the reported results and makes comprehensive evaluation and comparison challenging. This heterogeneity concerns in particular the choice of datasets, evaluation methodologies, and performance metrics. In this paper, we propose a unified framework designed to establish standardization in the validation of EEG-based seizure detection algorithms. Based on existing guidelines and recommendations, the framework introduces a set of recommendations and standards related to datasets, file formats, EEG data input content, seizure annotation input and output, cross-validation strategies, and performance metrics. We also propose the 10-20 seizure detection benchmark, a machine-learning benchmark based on public datasets converted to a standardized format. This benchmark defines the machine-learning task as well as reporting metrics. We illustrate the use of the benchmark by evaluating a set of existing seizure detection algorithms. The SzCORE (Seizure Community Open-source Research Evaluation) framework and benchmark are made publicly available along with an open-source software library to facilitate research use, while enabling rigorous evaluation of the clinical significance of the algorithms, fostering a collective effort to more optimally detect seizures to improve the lives of people with epilepsy.
翻译:随着动态和长期脑电图(EEG)监测的广泛应用,对基于EEG的高质量自动癫痫发作检测算法的需求日益迫切。这些算法验证方法的异质性会影响报告结果,使得全面评估和比较变得困难。这种异质性尤其涉及数据集、评估方法和性能指标的选择。在本文中,我们提出一个统一框架,旨在建立基于EEG的癫痫发作检测算法验证的标准化。基于现有指南和建议,该框架引入了一套关于数据集、文件格式、EEG数据输入内容、癫痫发作标注输入与输出、交叉验证策略和性能指标的建议与标准。我们还提出了10-20癫痫发作检测基准,这是一个基于转换为标准化格式的公开数据集的机器学习基准。该基准定义了机器学习任务以及报告指标。通过评估一组现有癫痫发作检测算法,我们展示了该基准的使用方法。SzCORE(癫痫社区开源研究评估)框架和基准与开源软件库一同公开提供,以促进研究应用,同时支持对算法临床意义的严格评估,从而推动集体努力更优地检测癫痫发作,改善癫痫患者的生活质量。