Gaussian Mixture Model with unknown diagonal covariances via continuous sparse regularization

This paper addresses the statistical estimation of Gaussian Mixture Models (GMMs) with unknown diagonal covariances from independent and identically distributed samples. We employ the Beurling-LASSO (BLASSO), a convex optimization framework that promotes sparsity in the space of measures, to simultaneously estimate the number of components and their parameters. Our main contribution extends the BLASSO methodology to multivariate GMMs with component-specific unknown diagonal covariance matrices. This setting is significantly more flexible than previous approaches, which required known and identical covariances. We establish non-asymptotic recovery guarantees with nearly parametric convergence rates for component means, diagonal covariances, and weights, as well as for density prediction. A key theoretical contribution is the identification of an explicit separation condition on mixture components that enables the construction of non-degenerate dual certificates-essential tools for establishing statistical guarantees for the BLASSO. Our analysis leverages the Fisher-Rao geometry of the statistical model and introduces a novel semi-distance adapted to our framework, providing new insights into the interplay between component separation, parameter space geometry, and achievable statistical recovery.

翻译：本文研究了在独立同分布样本下，对具有未知对角协方差的高斯混合模型进行统计估计的问题。我们采用Beurling-LASSO（BLASSO）这一凸优化框架——该框架可促进测度空间中的稀疏性——来同时估计混合成分的数量及其参数。本文的主要贡献在于将BLASSO方法扩展至具有成分特异性未知对角协方差矩阵的多变量GMM。与先前要求协方差已知且恒定的方法相比，本设置显著提升了灵活性。我们建立了非渐近恢复保证，使得成分均值、对角协方差、权重以及密度预测均能达到近乎参数的收敛速率。一项关键的理论贡献是识别出混合成分间显式的分离条件，该条件能够构造非退化对偶证书——这是为BLASSO建立统计保证的重要工具。我们的分析利用了统计模型的Fisher-Rao几何结构，并引入了一种适应本文框架的新型半距离，为理解成分分离度、参数空间几何结构与可实现的统计恢复之间的相互作用提供了新见解。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

稀疏混合专家模型 (SMoE) 的崛起：从算法基础、去中心化架构到垂直领域应用的综述

专知会员服务

17+阅读 · 2月12日

混合专家模型简述

专知会员服务

18+阅读 · 2025年5月30日

AAAI 25 | 融合分隔：协同专家混合模型用于数据稀缺环境下的药物-靶点相互作用预测

专知会员服务

12+阅读 · 2025年1月13日

【牛津大学博士论文】多模态概率推理的机器学习预测与协调，173页pdf

专知会员服务

87+阅读 · 2022年10月16日