零膨胀相关牙科数据的建模：基于高斯Copula与近似贝叶斯计算 (Modeling Zero-Inflated Correlated Dental Data through Gaussian Copulas and Approximate Bayesian Computation)

We develop a new longitudinal count data regression model that accounts for zero-inflation and spatio-temporal correlation across responses. This project is motivated by an analysis of Iowa Fluoride Study (IFS) data, a longitudinal cohort study with data on caries (cavity) experience scores measured for each tooth across five time points. To that end, we use a hurdle model for zero-inflation with two parts: the presence model indicating whether a count is non-zero through logistic regression and the severity model that considers the non-zero counts through a shifted Negative Binomial distribution allowing overdispersion. To incorporate dependence across measurement occasion and teeth, these marginal models are embedded within a Gaussian copula that introduces spatio-temporal correlations. A distinct advantage of this formulation is that it allows us to determine covariate effects with population-level (marginal) interpretations in contrast to mixed model choices. Standard Bayesian sampling from such a model is infeasible, so we use approximate Bayesian computing for inference. This approach is applied to the IFS data to gain insight into the risk factors for dental caries and the correlation structure across teeth and time.

翻译：本文提出了一种新的纵向计数数据回归模型，该模型能够同时处理零膨胀现象以及响应变量间的时空相关性。本研究的动机源于对爱荷华州氟化物研究数据的分析，该纵向队列研究记录了每颗牙齿在五个时间点上的龋齿（蛀牙）经历评分。为此，我们采用跨栏模型处理零膨胀问题，该模型包含两部分：通过逻辑回归判断计数是否非零的“存在模型”，以及通过允许过度离散的平移负二项分布处理非零计数的“严重程度模型”。为了纳入测量时点间与牙齿间的依赖性，我们将这些边际模型嵌入到能够引入时空相关性的高斯Copula框架中。该构建方式的一个显著优势在于，相较于混合模型，它允许我们在总体水平（边际）上解释协变量的效应。由于从该模型进行标准贝叶斯采样不可行，我们采用近似贝叶斯计算进行推断。将此方法应用于IFS数据，有助于深入理解龋齿的风险因素以及牙齿间和跨时间的相关性结构。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

【亚马逊-WWW2020】不解析,生成!用于面向任务的语义分析的序列到序列体系结构，Don't Parse, Generate! A Sequence to Sequence Architecture for Task-Oriented Semantic Parsing

专知会员服务

15+阅读 · 2020年2月1日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日