Efficient Computation of Normalized Maximum Likelihood Codes for Gaussian Mixture Models With Its Applications to Clustering

Hirai S.; Yamanishi K.

首页> 外文期刊>IEEE Transactions on Information Theory >Efficient Computation of Normalized Maximum Likelihood Codes for Gaussian Mixture Models With Its Applications to Clustering

【24h】

Efficient Computation of Normalized Maximum Likelihood Codes for Gaussian Mixture Models With Its Applications to Clustering

机译：高斯混合模型的归一化最大似然码的有效计算及其在聚类中的应用

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper addresses the issue of estimating from a given data sequence the number of mixture components for a Gaussian mixture model(GMM). Our approach is to compute the normalized maximum likelihood (NML) code length for the data sequence relative to a GMM, then to find the mixture size that attains the minimum of the NML on the basis of the minimum description length principle. For finite domains, Kontkanen and Myllymäki proposed a method for efficient computation of the NML code length for specific models, however, for general classes over infinite domains, it has remained open how we compute the NML code length efficiently. We first propose a general method for calculating the NML code length for a general exponential family. Then, we apply it to the efficient computation of the NML code length for a GMM. The key idea is to restrict the data domain in combination with the technique of employing a generating function for computing the normalization term for a GMM. We use artificial datasets to empirically demonstrate that our estimate of the mixture size converges to the true one significantly faster than other criteria.

机译：本文讨论了从给定的数据序列中估计高斯混合模型（GMM）的混合成分数量的问题。我们的方法是计算相对于GMM的数据序列的归一化最大似然（NML）码长度，然后根据最小描述长度原理找到达到NML最小值的混合大小。对于有限域，Kontkanen和Myllymäki提出了一种用于有效计算特定模型的NML代码长度的方法，但是，对于无限域上的常规类，如何有效地计算NML代码长度仍未解决。我们首先提出一种用于计算一般指数族的NML代码长度的一般方法。然后，我们将其应用于GMM的NML代码长度的有效计算。关键思想是结合采用生成函数来计算GMM标准化项的技术来限制数据域。我们使用人工数据集凭经验证明，我们对混合物尺寸的估计收敛到真实值的速度明显快于其他标准。

著录项

来源
《IEEE Transactions on Information Theory》 |2013年第11期|7718-7727|共10页
作者
Hirai S.; Yamanishi K.;
展开▼
作者单位

Graduate School of Information Science and Technology, The University of Tokyo, Tokyo, JAPAN|c|;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Clustering; minimum description length (MDL) principle; normalized maximum likelihood (NML);

机译：聚类;最小描述长度（MDL）原理;归一化最大似然（NML）;

相似文献

外文文献
中文文献
专利

1. Correction to Efficient Computation of Normalized Maximum Likelihood Codes for Gaussian Mixture Models With Its Applications to Clustering [J] . Hirai So, Yamanishi Kenji IEEE Transactions on Information Theory . 2019,第10期

机译：高斯混合模型归一化最大似然码有效计算的校正及其在聚类中的应用
2. Correction to Efficient Computation of Normalized Maximum Likelihood Codes for Gaussian Mixture Models With Its Applications to Clustering [J] . Hirai So, Yamanishi Kenji IEEE Transactions on Information Theory . 2019,第10期

机译：校正以高斯混合模型的归一化最大似然码与其应用于聚类的校正
3. ON THE NONPARAMETRIC MAXIMUM LIKELIHOOD ESTIMATOR FOR GAUSSIAN LOCATION MIXTURE DENSITIES WITH APPLICATION TO GAUSSIAN DENOISING [J] . The Annals of Statistics: An Official Journal of the Institute of Mathematical Statistics . 2020,第2期

机译：高斯定位混合密度的非参数最大似然估计与高斯去噪
4. Efficient computation of normalized maximum likelihood coding for Gaussian mixtures with its applications to optimal clustering [C] . Hirai So, Yamanishi Kenji 2011 IEEE International Symposium on Information Theory Proceedings . 2011

机译：高斯混合物归一化最大似然编码的有效计算及其在最佳聚类中的应用
5. Maximum likelihood estimation in Gaussian AMP chain graph models and Gaussian ancestral graph models. [D] . Drton, Mathias. 2004

机译：高斯AMP链图模型和高斯祖先图模型中的最大似然估计。
6. Gaussian Mixture Models of Between-Source Variation for Likelihood Ratio Computation from Multivariate Data [O] . Javier Franco-Pedroso, Daniel Ramos, Joaquin Gonzalez-Rodriguez -1

机译：基于多元数据的似然比计算的源间变异高斯混合模型
7. Computationally Efficient Gaussian Maximum Likelihood Methods for Vector ARFIMA Models [O] . Sela Rebecca J., Hurvich Clifford M. 2008

机译：矢量aRFIma模型的计算有效高斯极大似然方法
8. MALCOM X: Combining maximum likelihood continuity mapping with Gaussian mixture models [R] . Hogden, J. , Scovel, J. C. 1998

机译：maLCOm X：将最大似然连续性映射与高斯混合模型相结合

Efficient Computation of Normalized Maximum Likelihood Codes for Gaussian Mixture Models With Its Applications to Clustering

摘要

著录项

相似文献

相关主题

期刊订阅