Publicly available full-field digital mammography (FFDM) datasets remain limited in size, clinical annotations, and vendor diversity, hindering the development of robust models. We introduce LUMINA, a curated, multi-vendor FFDM dataset that explicitly encodes acquisition energy and vendor metadata to capture clinically relevant appearance variations often overlooked in existing benchmarks. This dataset contains 1824 images from 468 patients (960 benign, 864 malignant), with pathology-confirmed labels, BI-RADS assessments, and breast-density annotations. LUMINA spans six acquisition systems and includes both high- and low-energy imaging styles, enabling systematic analysis of vendor- and energy-induced domain shifts. To address these variations, we propose a foreground-only pixel-space alignment method (''energy harmonization'') that maps images to a low-energy reference while preserving lesion morphology. We benchmark CNN and transformer models on three clinically relevant tasks: diagnosis (benign vs. malignant), BI-RADS classification, and density estimation. Two-view models consistently outperform single-view models. EfficientNet-B0 achieves an AUC of 93.54% for diagnosis, while Swin-T achieves the best macro-AUC of 89.43% for density prediction. Harmonization improves performance across architectures and produces more localized Grad-CAM responses. Overall, LUMINA provides (1) a vendor-diverse benchmark and (2) a model-agnostic harmonization framework for reliable and deployable mammography AI.
翻译:现有的公开全视野数字乳腺X线影像(FFDM)数据集在规模、临床标注和厂商多样性方面仍十分有限,这阻碍了稳健模型的开发。我们提出LUMINA——一个经精心整理的多厂商FFDM数据集,该数据集显式编码了采集能量和厂商元数据,以捕捉现有基准测试常忽略的临床相关外观变异。该数据集包含来自468名患者的1824幅影像(960例良性、864例恶性),并配有病理确诊标签、BI-RADS评估和乳腺密度标注。LUMINA覆盖六种采集系统,同时包含高能量和低能量成像模式,从而支持对厂商和能量诱发域偏移的系统分析。为应对这些变异,我们提出一种仅前景像素空间对齐方法(“能量归一化”),在保留病灶形态的同时将影像映射至低能量参考图像。我们基于卷积神经网络和Transformer模型,在三个临床相关任务上进行基准测试:诊断(良性vs.恶性)、BI-RADS分类和密度估计。双视角模型表现始终优于单视角模型。EfficientNet-B0在诊断任务中达到93.54%的AUC,而Swin-T在密度预测任务中取得89.43%的最佳宏平均AUC。能量归一化方法可提升各架构的性能,并生成更聚焦的Grad-CAM响应。总体而言,LUMINA提供了(1)一个厂商多样的基准测试集,以及(2)一个模型无关的归一化框架,用于构建可靠且可部署的乳腺X线影像人工智能系统。