Mineral prospectivity mapping requires synthesizing heterogeneous geological knowledge, including textual deposit models and geospatial datasets, to identify regions likely to host specific mineral deposit types. This process is traditionally manual and knowledge-intensive. We present QueryPlot, a semantic retrieval and mapping framework that integrates large-scale geological text corpora with geologic map data using modern Natural Language Processing techniques. We curate descriptive deposit models for over 120 deposit types and transform the State Geologic Map Compilation (SGMC) polygons into structured textual representations. Given a user-defined natural language query, the system encodes both queries and region descriptions using a pretrained embedding model and computes semantic similarity scores to rank and spatially visualize regions as continuous evidence layers. QueryPlot supports compositional querying over deposit characteristics, enabling aggregation of multiple similarity-derived layers for multi-criteria prospectivity analysis. In a case study on tungsten skarn deposits, we demonstrate that embedding-based retrieval achieves high recall of known occurrences and produces prospective regions that closely align with expert-defined permissive tracts. Furthermore, similarity scores can be incorporated as additional features in supervised learning pipelines, yielding measurable improvements in classification performance. QueryPlot is implemented as a web-based system supporting interactive querying, visualization, and export of GIS-compatible prospectivity layers.To support future research, we have made the source code and datasets used in this study publicly available.
翻译:矿产远景制图需要综合异构地质知识(包括文本矿床模型与地理空间数据集)以识别可能赋存特定矿床类型的区域。该过程传统上依赖人工且具有知识密集型特征。本文提出QueryPlot——一种语义检索与制图框架,其利用现代自然语言处理技术将大规模地质文本语料与地质图数据相融合。我们系统整理了120余种矿床类型的描述性模型,并将《州地质图汇编》(SGMC)多边形数据转化为结构化文本表征。针对用户定义的自然语言查询,系统通过预训练嵌入模型对查询语句与区域描述进行编码,计算语义相似度得分以排序区域,并将结果以连续证据层形式进行空间可视化。QueryPlot支持对矿床特征进行组合式查询,能够聚合多个相似度衍生层以开展多准则远景分析。在矽卡岩型钨矿床的案例研究中,我们证明基于嵌入的检索方法能够高召回率地识别已知矿点,且生成的远景区与专家定义的许可带高度吻合。此外,相似度得分可作为监督学习流程的附加特征,从而显著提升分类性能。QueryPlot以后端系统形式实现,支持交互式查询、可视化及GIS兼容远景层的导出。为促进后续研究,本研究涉及的源代码与数据集均已公开。