薄冰之上：通过归因与扰动实现可解释的保护监测 (On Thin Ice: Towards Explainable Conservation Monitoring via Attribution and Perturbations)

Jiayi Zhou,Günel Aghakishiyeva,Saagar Arya,Julian Dale,James David Poling,Holly R. Houliston,Jamie N. Womble,Gregory D. Larsen,David W. Johnston,Brinnae Bent

from arxiv, NeurIPS Imageomics Workshop 2025

Computer vision can accelerate ecological research and conservation monitoring, yet adoption in ecology lags in part because of a lack of trust in black-box neural-network-based models. We seek to address this challenge by applying post-hoc explanations to provide evidence for predictions and document limitations that are important to field deployment. Using aerial imagery from Glacier Bay National Park, we train a Faster R-CNN to detect pinnipeds (harbor seals) and generate explanations via gradient-based class activation mapping (HiResCAM, LayerCAM), local interpretable model-agnostic explanations (LIME), and perturbation-based explanations. We assess explanations along three axes relevant to field use: (i) localization fidelity: whether high-attribution regions coincide with the animal rather than background context; (ii) faithfulness: whether deletion/insertion tests produce changes in detector confidence; and (iii) diagnostic utility: whether explanations reveal systematic failure modes. Explanations concentrate on seal torsos and contours rather than surrounding ice/rock, and removal of the seals reduces detection confidence, providing model-evidence for true positives. The analysis also uncovers recurrent error sources, including confusion between seals and black ice and rocks. We translate these findings into actionable next steps for model development, including more targeted data curation and augmentation. By pairing object detection with post-hoc explainability, we can move beyond "black-box" predictions toward auditable, decision-supporting tools for conservation monitoring.

翻译：计算机视觉能够加速生态学研究和保护监测，然而其在生态学领域的应用仍显滞后，部分原因在于对基于黑盒神经网络模型的信任缺失。我们试图通过应用事后解释方法来解决这一挑战，为预测提供证据并记录对实地部署至关重要的局限性。利用冰川湾国家公园的航拍图像，我们训练了一个Faster R-CNN模型来检测鳍足类动物（斑海豹），并通过基于梯度的类激活映射（HiResCAM、LayerCAM）、局部可解释模型无关解释（LIME）以及基于扰动的解释方法生成解释。我们从三个与实地应用相关的维度评估这些解释：（一）定位保真度：高归因区域是否与动物本身而非背景环境重合；（二）忠实度：删除/插入测试是否会导致检测器置信度的变化；（三）诊断效用：解释是否揭示了系统性的失效模式。解释结果集中于海豹躯干和轮廓而非周围的冰层/岩石，移除海豹区域会降低检测置信度，从而为真阳性预测提供了模型证据。分析还揭示了重复出现的误差来源，包括海豹与黑冰及岩石之间的混淆。我们将这些发现转化为模型开发的可操作后续步骤，包括更具针对性的数据筛选与增强。通过将目标检测与事后可解释性相结合，我们能够超越“黑盒”预测，迈向可用于审计、支持保护监测决策的工具。

相关内容

黑盒

关注 1

在科学，计算和工程学中，黑盒是一种设备，系统或对象，可以根据其输入和输出（或传输特性）对其进行查看，而无需对其内部工作有任何了解。它的实现是“不透明的”（黑色）。几乎任何事物都可以被称为黑盒：晶体管，引擎，算法，人脑，机构或政府。为了使用典型的“黑匣子方法”来分析建模为开放系统的事物，仅考虑刺激/响应的行为，以推断（未知）盒子。该黑匣子系统的通常表示形式是在该方框中居中的数据流程图。黑盒的对立面是一个内部组件或逻辑可用于检查的系统，通常将其称为白盒（有时也称为“透明盒”或“玻璃盒”）。

O’Reilly报告：知识图谱崛起——面向现代数据集成和数据结构体系，“The Rise of the Knowledge Graph——Toward Modern Data Integration and the Data Fabric Architecture”

专知会员服务

49+阅读 · 2022年2月18日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日