Penalized KLIC Model Selection for the Generalized Method of Moments in Longitudinal Data with Time-Dependent Covariates

Model selection plays an important role in longitudinal data analysis, especially when models are estimated using the generalized method of moments (GMM) in the presence of time-dependent covariates. In this setting, the number of valid moment conditions can grow quickly and may lead to over-parameterized models. The Kullback--Leibler Information Criterion (KLIC) has been proposed as a model-selection tool for this framework; however, the original KLIC criterion may favor overly complex models when the number of parameters or valid moment conditions increases. To address this limitation, this study proposes two penalized versions of KLIC that incorporate penalties based on both the number of model parameters and the number of valid moment conditions. The proposed criteria are referred to as the Moment--Parameter Product Penalty KLIC (MPPP--KLIC) and the Logarithmic Penalty KLIC (LP--KLIC). These criteria provide a theoretically motivated mechanism for balancing model fit and model complexity in GMM-based longitudinal models. Through an extensive simulation study involving both binary and continuous response settings, the proposed criteria are shown to improve the ability of KLIC to distinguish among competing models and to reduce the selection of over-parameterized models. The performance of the proposed methods is further illustrated using the Filipino Child Morbidity dataset, a longitudinal study of child health in the Philippines. The results show that the proposed penalized criteria provide stable and interpretable model rankings and consistently identify age as the most important predictor of child morbidity. Overall, the proposed penalized KLIC criteria offer practical and theoretically grounded tools for model selection in GMM-based longitudinal data analysis with time-dependent covariates.

翻译：模型选择在纵向数据分析中具有重要作用，尤其当模型采用广义矩方法（GMM）估计且存在时变协变量时。在此框架下，有效矩条件的数量可能快速增长，导致过度参数化模型。Kullback-Leibler信息准则（KLIC）已被提出作为该场景下的模型选择工具；然而，当参数或有效矩条件数量增加时，原始KLIC准则可能偏向过于复杂的模型。为解决这一局限，本研究提出两种带罚项的KLIC版本，其罚项基于模型参数数量与有效矩条件数量构建。所提准则分别称为矩-参数乘积罚项KLIC（MPPP-KLIC）和对数罚项KLIC（LP-KLIC）。这些准则为基于GMM的纵向模型在拟合优度与复杂度之间提供了理论驱动的平衡机制。通过涵盖二元与连续响应场景的广泛模拟研究，证明所提准则能提升KLIC区分竞争模型的能力，并减少对过度参数化模型的选择。进一步利用菲律宾儿童发病率数据集（一项关于菲律宾儿童健康的纵向研究）展示了所提方法的性能。结果表明，所提带罚项准则能提供稳定且可解释的模型排序，并一致识别出年龄是儿童发病率的最重要预测因子。总体而言，所提带罚项KLIC准则为含时变协变量的GMM纵向数据分析中的模型选择提供了兼具实用性与理论依据的工具。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【ICML2025】大语言模型中有效长上下文建模的长短对齐方法

专知会员服务

13+阅读 · 2025年6月16日

参数高效微调方法有哪些？岭大等最新《预训练语言模型的参数高效微调》综述，

专知会员服务

70+阅读 · 2023年12月21日

RAG+LLM=？同济大学等最新《大型语言模型的检索增强生成》综述

专知会员服务

111+阅读 · 2023年12月19日

大模型如何利用数据？北大华为等最新《大型语言模型的数据管理》综述

专知会员服务

99+阅读 · 2023年12月6日