Clustering multivariate data using factor analytic Bayesian mixtures with an unknown number of components

Papastamoulis Panagiotis

首页> 外文期刊>Statistics and computing >Clustering multivariate data using factor analytic Bayesian mixtures with an unknown number of components

【24h】

Clustering multivariate data using factor analytic Bayesian mixtures with an unknown number of components

机译：使用成分数量未知的因子分析贝叶斯混合物对多元数据进行聚类

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Recent work on overfitting Bayesian mixtures of distributions offers a powerful framework for clustering multivariate data using a latent Gaussian model which resembles the factor analysis model. The flexibility provided by overfitting mixture models yields a simple and efficient way in order to estimate the unknown number of clusters and model parameters by Markov chain Monte Carlo sampling. The present study extends this approach by considering a set of eight parameterizations, giving rise to parsimonious representations of the covariance matrix per cluster. A Gibbs sampler combined with a prior parallel tempering scheme is implemented in order to approximately sample from the posterior distribution of the overfitting mixture. The parameterization and number of factors are selected according to the Bayesian information criterion. Identifiability issues related to label switching are dealt by post-processing the simulated output with the Equivalence Classes Representatives algorithm. The contributed method and software are demonstrated and compared to similar models estimated using the expectation-maximization algorithm on simulated and real datasets. The software is available online at .

机译：最近关于拟合贝叶斯分布混合的工作为使用类似于因子分析模型的潜在高斯模型聚类多元数据提供了强大的框架。通过过度拟合混合模型提供的灵活性产生了一种简单有效的方法，以便通过马尔可夫链蒙特卡洛采样来估计未知数目的聚类和模型参数。本研究通过考虑一组八个参数化来扩展此方法，从而产生了每个聚类的协方差矩阵的简约表示。吉布斯采样器结合了先前的平行回火方案，以便从过拟合混合物的后分布中近似采样。根据贝叶斯信息准则选择参数化和因子数量。通过使用等价类代表算法对模拟输出进行后期处理，可以解决与标签切换有关的可识别性问题。演示了所贡献的方法和软件，并将其与在模拟和真实数据集上使用期望最大化算法估算的相似模型进行了比较。该软件可从以下网站在线获得。

著录项

来源
《Statistics and computing》 |2020年第3期|485-506|共22页
作者
Papastamoulis Panagiotis;
展开▼
作者单位

Athens Univ Econom Dept Stat Business 76 Patiss St GR-10434 Athens Greece;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Mixture model; Factor analysis; MCMC; R package;

机译：混合模型因子分析;MCMC;R包;

相似文献

外文文献
中文文献
专利

1. Bayesian multivariate Poisson mixtures with an unknown number of components [J] . Loukia Meligkotsidou Statistics and computing . 2007,第2期

机译：具有未知数量组分的贝叶斯多元Poisson混合物
2. Overrating Bayesian mixtures of factor analyzers with an unknown number of components [J] . Papastamoulis Panagiotis Computational statistics & data analysis . 2018,第期

机译：具有未知组件数量分析仪的贝叶斯混合物
3. A new R package for Bayesian estimation of multivariate normal mixtures allowing for selection of the number of components and interval-censored data [J] . Komarek A Computational statistics & data analysis . 2009,第12期

机译：用于多元正态混合的贝叶斯估计的新R包，允许选择组分的数量和区间删节的数据
4. BAYESIAN ESTIMATION OF MIXTURES OF SKEWED ALPHA STABLE DISTRIBUTIONS WITH AN UNKNOWN NUMBER OF COMPONENTS [C] . D. Salas-Gonzalez, E. E. Kuruoglu, D. P. Ruiz European Signal Processing Conference;EUSIPCO . 2006

机译：组件数未知的歪斜α稳定分布混合的贝叶斯估计
5. Bayesian Growth Mixture Model for Clustering Longitudinal Data [D] . Lu, Zihang. 2020

机译：贝叶斯成长混合模型聚类纵向数据
6. A multivariate Poisson-log normal mixture model for clustering transcriptome sequencing data [O] . Anjali Silva, Steven J. Rothstein, Paul D. McNicholas, 2019

机译：聚类转录组测序数据的多元Poisson-log正态混合模型
7. Overfitting Bayesian Mixtures of Factor Analyzers with an Unknown Number of Components [O] . Papastamoulis, Panagiotis 2017

机译：过度拟合具有未知数的因子分析器的贝叶斯混合组件
8. Determining the Number of Component Clusters in the Standard Multivariate Normal Mixture Model Using Model-Selection Criteria. [R] . Bozdogan, H. 1983

机译：使用模型选择标准确定标准多元正态混合模型中的组分簇数。

Clustering multivariate data using factor analytic Bayesian mixtures with an unknown number of components

摘要

著录项

相似文献

相关主题

期刊订阅