GFM4MPM: Towards Geospatial Foundation Models for Mineral Prospectivity Mapping

Machine Learning (ML) for Mineral Prospectivity Mapping (MPM) remains a challenging problem as it requires the analysis of associations between large-scale multi-modal geospatial data and few historical mineral commodity observations (positive labels). Recent MPM works have explored Deep Learning (DL) as a modeling tool with more representation capacity. However, these overparameterized methods may be more prone to overfitting due to their reliance on scarce labeled data. While a large quantity of unlabeled geospatial data exists, no prior MPM works have considered using such information in a self-supervised manner. Our MPM approach uses a masked image modeling framework to pretrain a backbone neural network in a self-supervised manner using unlabeled geospatial data alone. After pretraining, the backbone network provides feature extraction for downstream MPM tasks. We evaluated our approach alongside existing methods to assess mineral prospectivity of Mississippi Valley Type (MVT) and Clastic-Dominated (CD) Lead-Zinc deposits in North America and Australia. Our results demonstrate that self-supervision promotes robustness in learned features, improving prospectivity predictions. Additionally, we leverage explainable artificial intelligence techniques to demonstrate that individual predictions can be interpreted from a geological perspective.

翻译：矿产资源潜力预测（MPM）中的机器学习（ML）仍然是一个具有挑战性的问题，因为它需要分析大规模多模态地理空间数据与少量历史矿产观测（正样本）之间的关联关系。近期的MPM研究探索了使用具有更强表征能力的深度学习（DL）作为建模工具。然而，这些过参数化方法因其对稀缺标注数据的依赖，可能更容易出现过拟合。尽管存在大量未标注的地理空间数据，但此前尚无MPM研究考虑以自监督的方式利用此类信息。我们的MPM方法采用掩码图像建模框架，仅使用未标注的地理空间数据以自监督方式预训练一个骨干神经网络。预训练完成后，该骨干网络为下游MPM任务提供特征提取功能。我们评估了我们的方法以及现有方法，以评估北美和澳大利亚密西西比河谷型（MVT）和碎屑岩型（CD）铅锌矿床的矿产资源潜力。我们的结果表明，自监督增强了学习特征的鲁棒性，从而改善了潜力预测。此外，我们利用可解释人工智能技术证明，单个预测可以从地质学角度进行解释。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

【CHI2020-微软】解释可解释性:理解数据科学家使用机器学习的可解释性工具，Interpreting Interpretability: Understanding Data Scientists’Use of Interpretability Tools for Machine Learning

专知会员服务

55+阅读 · 2020年3月8日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日