Enhancing Tree Species Classification: Insights from YOLOv8 and Explainable AI Applied to TLS Point Cloud Projections

Aiming to advance research in the field of interpretability of deep learning models for tree species classification using TLS 3D point clouds we present insights in the classification abilities of YOLOv8 through a new framework which enables systematic analysis of saliency maps derived from CAM (Class Activation Mapping). To investigate the contribution of structural tree features to the classification decisions of the models, we link regions with high saliency derived from the application of Finer-CAM to segments of 2D side-view images that correspond to structural tree features. Using TLS 3D point clouds from 2445 trees across seven European tree species, we trained five YOLOv8 models with cross-validation, reaching a mean accuracy of 96% (SD = 0.24%) when applied to the test data. Our results demonstrate that Finer-CAM can be considered faithful in identifying discriminative regions that discriminate target tree species. This renders Finer-CAM suitable for enhancing the interpretability of the tree species classification models. Analysis of 630 saliency maps indicate that the models primarily rely on image regions associated with tree crowns for species classification. While this result is pronounced in Silver Birch, European Beech, English oak, and Norway Spruce, image regions associated with stems contribute more frequently to the differentiation of European ash, Scots pine, and Douglas-fir. We demonstrate that the visibility of detailed structural tree features in the 2D side-view images enhances the discriminative performances of the models, indicating YOLOv8`s abilities to leverage detailed point cloud representations. Our results represent a first step toward enhancing the understanding of the classification decision processes of tree species classification models, aiding in the identification of data set and model limitations, and building confidence in model predictions.

翻译：为推进基于TLS三维点云的深度学习模型在树种分类可解释性领域的研究，本文通过构建新型分析框架，系统解析源自CAM（类别激活映射）的显著性图谱，从而揭示YOLOv8模型的分类能力。为探究树木结构特征对模型分类决策的贡献度，我们将Finer-CAM生成的高显著性区域与对应树木结构特征的二维侧视图像片段进行关联分析。基于覆盖七种欧洲树种的2445棵树木的TLS三维点云数据，我们通过交叉验证训练了五个YOLOv8模型，在测试数据上达到平均96%（标准差=0.24%）的分类准确率。实验结果表明，Finer-CAM能可靠识别区分目标树种的关键判别区域，这使其适用于增强树种分类模型的可解释性。对630幅显著性图谱的分析表明，模型主要依赖树冠关联图像区域进行树种分类。该特征在银桦、欧洲山毛榉、英国栎和挪威云杉中表现显著，而欧洲白蜡树、欧洲赤松和花旗松的判别则更频繁地依赖于树干关联图像区域。我们证明二维侧视图像中树木细节结构特征的可见性能够提升模型的判别性能，这表明YOLOv8具备利用精细点云表征的能力。本研究为深入理解树种分类模型的决策机制迈出关键一步，有助于识别数据集与模型的局限性，并为提升模型预测的可信度奠定基础。

相关内容

点云

关注 50

根据激光测量原理得到的点云，包括三维坐标（XYZ）和激光反射强度（Intensity）。根据摄影测量原理得到的点云，包括三维坐标（XYZ）和颜色信息（RGB）。结合激光测量和摄影测量原理得到点云，包括三维坐标（XYZ）、激光反射强度（Intensity）和颜色信息（RGB）。在获取物体表面每个采样点的空间坐标后，得到的是一个点的集合，称之为“点云”(Point Cloud)

《基于随机森林的Q学习算法：迈向可解释人工智能》最新90页

专知会员服务

32+阅读 · 2025年3月15日

YOLOv1 到 YOLOv10：最快且最准确的实时目标检测系统

专知会员服务

42+阅读 · 2024年8月22日

人工智能（XAI）可解释性的研究进展！

专知会员服务

34+阅读 · 2024年6月12日