The global burden of acute and chronic wounds presents a compelling case for enhancing wound classification methods, a vital step in diagnosing and determining optimal treatments. Recognizing this need, we introduce an innovative multi-modal network based on a deep convolutional neural network for categorizing wounds into four categories: diabetic, pressure, surgical, and venous ulcers. Our multi-modal network uses wound images and their corresponding body locations for more precise classification. A unique aspect of our methodology is incorporating a body map system that facilitates accurate wound location tagging, improving upon traditional wound image classification techniques. A distinctive feature of our approach is the integration of models such as VGG16, ResNet152, and EfficientNet within a novel architecture. This architecture includes elements like spatial and channel-wise Squeeze-and-Excitation modules, Axial Attention, and an Adaptive Gated Multi-Layer Perceptron, providing a robust foundation for classification. Our multi-modal network was trained and evaluated on two distinct datasets comprising relevant images and corresponding location information. Notably, our proposed network outperformed traditional methods, reaching an accuracy range of 74.79% to 100% for Region of Interest (ROI) without location classifications, 73.98% to 100% for ROI with location classifications, and 78.10% to 100% for whole image classifications. This marks a significant enhancement over previously reported performance metrics in the literature. Our results indicate the potential of our multi-modal network as an effective decision-support tool for wound image classification, paving the way for its application in various clinical contexts.
翻译:急性和慢性伤口的全球负担凸显了改进伤口分类方法的必要性,这是诊断和确定最佳治疗方案的关键步骤。针对这一需求,我们提出了一种基于深度卷积神经网络的新型多模态网络,用于将伤口分为四类:糖尿病性溃疡、压力性溃疡、手术切口和静脉性溃疡。我们的多模态网络利用伤口图像及其对应的身体位置信息,实现更精确的分类。该方法的一个独特之处在于引入了身体地图系统,可促进准确的伤口位置标记,从而改进传统的伤口图像分类技术。我们方法的另一显著特征是将VGG16、ResNet152和EfficientNet等模型集成到新颖架构中,该架构包含空间和通道维度的Squeeze-and-Excitation模块、轴向注意力机制以及自适应门控多层感知器,为分类提供了稳健基础。我们的多模态网络在包含相关图像和对应位置信息的两个不同数据集上进行了训练和评估。值得注意的是,所提出的网络优于传统方法,在无位置分类的感兴趣区域(ROI)上达到74.79%至100%的准确率,在有位置分类的ROI上达到73.98%至100%,在全图分类上达到78.10%至100%。这标志着相较于文献中先前报告的性能指标有了显著提升。结果表明,我们的多模态网络有潜力成为有效的伤口图像分类决策支持工具,为其在各种临床场景中的应用铺平了道路。