Spatial meshing for general Bayesian multivariate models

Quantifying spatial and/or temporal associations in multivariate geolocated data of different types is achievable via spatial random effects in a Bayesian hierarchical model, but severe computational bottlenecks arise when spatial dependence is encoded as a latent Gaussian process (GP) in the increasingly common large scale data settings on which we focus. The scenario worsens in non-Gaussian models because the reduced analytical tractability leads to additional hurdles to computational efficiency. In this article, we introduce Bayesian models of spatially referenced data in which the likelihood or the latent process (or both) are not Gaussian. First, we exploit the advantages of spatial processes built via directed acyclic graphs, in which case the spatial nodes enter the Bayesian hierarchy and lead to posterior sampling via routine Markov chain Monte Carlo (MCMC) methods. Second, motivated by the possible inefficiencies of popular gradient-based sampling approaches in the multivariate contexts on which we focus, we introduce the simplified manifold preconditioner adaptation (SiMPA) algorithm which uses second order information about the target but avoids expensive matrix operations. We demostrate the performance and efficiency improvements of our methods relative to alternatives in extensive synthetic and real world remote sensing and community ecology applications with large scale data at up to hundreds of thousands of spatial locations and up to tens of outcomes. Software for the proposed methods is part of R package 'meshed', available on CRAN.

翻译：在贝叶斯分层模型中，通过空间随机效应可以实现对不同类型多元地理定位数据中空间和/或时间关联的量化，但当空间依赖性被编码为潜在高斯过程时，在我们重点关注的大规模数据场景中会出现严重的计算瓶颈。非高斯模型的情况更为严峻，因为解析可处理性的降低导致计算效率面临额外障碍。本文针对似然函数或潜在过程（或两者）均为非高斯形式的空间参考数据，提出了贝叶斯建模方法。首先，我们利用基于有向无环图构建空间过程的优势，在此框架下空间节点进入贝叶斯分层结构，并通过常规马尔可夫链蒙特卡洛方法实现后验抽样。其次，针对我们关注的多元场景中常用的基于梯度的采样方法可能存在的效率低下问题，我们引入简化流形预处理器自适应算法，该算法利用目标分布的二阶信息，但避免了昂贵的矩阵运算。通过包含数十万空间位置和数十个结果变量的大规模数据合成实验及真实遥感与群落生态学应用案例，我们验证了所提方法相对于替代方案在性能与效率上的提升。本文方法的配套软件已收录于CRAN平台的R语言程序包'meshed'中。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

【ACL2020】多模态信息抽取，365页ppt

专知会员服务

151+阅读 · 2020年7月6日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日