We conducted a reproducibility-oriented re-evaluation of prior migraine classification studies, correcting for data leakage and metric bias. We then introduced (i) a clinically motivated aggregation of two hemiplegic subtypes following ICHD-3 §1.2.3, (ii) a class-dependent hybrid augmentation strategy that assigns generation methods based on per-class sample size, and (iii) the concept of fidelity asymmetry, motivating proportionally constrained growth as an alternative to full class balance. Experiments were performed on a dataset of 400 patients across seven migraine subtypes under a two-stage protocol, including the six-class configuration described above. Models were evaluated using stratified 5-fold cross-validation with macro-averaged F1 as the primary metric. Correcting methodological flaws reduces previously inflated performance estimates, with the corrected macro-F1 baseline standing at 0.71. The proposed framework consistently outperformed individual augmenters in macro-F1 averaged across the eight evaluated classifiers (0.862 vs. 0.836 for Gaussian Copula, 0.815 for CTGAN, and 0.801 for the no-augmentation baseline), and achieved its peak result of 0.914 with FT-Transformer under proportional augmentation. The no-augmentation FT-Transformer baseline (0.896) shows that, at the per-classifier ceiling, clinically motivated class aggregation accounts for most of the absolute improvement; the framework's principal measurable contribution is the gain in average robustness across classifiers, highlighting the dominant role of problem formulation.
翻译:我们以可重复性为导向重新评估了既往偏头痛分类研究,针对数据泄露与指标偏差进行了校正。随后提出:(i)依据ICHD-3 §1.2.3标准对两种偏瘫亚型进行临床驱动的聚合;(ii)基于类别样本量分配生成方法的类别依赖混合增强策略;(iii)保真度不对称性概念,提出按比例约束增长替代完全类别平衡。实验基于包含400名患者、涵盖七种偏头痛亚型的数据集,采用两阶段方案(含上述六类别配置)进行。模型通过分层五折交叉验证评估,以宏平均F1为主要指标。纠正方法论缺陷后,原始高估的性能估算值得以修正,校正后的宏F1基线为0.71。在八种评估分类器中,所提框架的宏平均F1始终优于单一增强方法(0.862 vs 高斯Copula的0.836、CTGAN的0.815及无增强基线的0.801),并在比例增强条件下以FT-Transformer达到峰值0.914。无增强FT-Transformer基线(0.896)表明,在分类器性能天花板附近,临床驱动的类别聚合贡献了大部分绝对改进;该框架的主要可量化贡献在于提升跨分类器的平均鲁棒性,凸显了问题定义的主导作用。