首页> 外文期刊>Journal of applied statistics >Mixtures of general location model with factor analyzer covariance structure for clustering mixed type data
【24h】

Mixtures of general location model with factor analyzer covariance structure for clustering mixed type data

机译:具有因子分析器协方差结构的通用位置模型的混合物,用于对混合类型数据进行聚类

获取原文
获取原文并翻译 | 示例
           

摘要

Cluster analysis is one of the most widely used method in statistical analyses, in which homogeneous subgroups are identified in a heterogeneous population. Due to the existence of the continuous and discrete mixed data in many applications, so far, some ordinary clustering methods such as, hierarchical methods, k-means and model-based methods have been extended for analysis of mixed data. However, in the available model-based clustering methods, by increasing the number of continuous variables, the number of parameters increases and identifying as well as fitting an appropriate model may be difficult. In this paper, to reduce the number of the parameters, for the model-based clustering mixed data of continuous (normal) and nominal data, a set of parsimonious models is introduced. Models in this set are extended, using the general location model approach, for modeling distribution of mixed variables and applying factor analyzer structure for covariance matrices. The ECM algorithm is used for estimating the parameters of these models. In order to show the performance of the proposed models for clustering, results from some simulation studies and analyzing two real data sets are presented.
机译:聚类分析是统计分析中使用最广泛的方法之一,其中在异类种群中鉴定出同质亚组。由于在许多应用中都存在连续和离散的混合数据,到目前为止,一些常规的聚类方法(例如分层方法,k均值和基于模型的方法)已经扩展到混合数据的分析。但是,在可用的基于模型的聚类方法中,通过增加连续变量的数量,参数的数量会增加,识别和拟合合适的模型可能会很困难。为了减少参数数量,针对连续(正常)数据和名义数据的基于模型的聚类混合数据,引入了一组简约模型。使用通用位置模型方法扩展了该集中的模型,以对混合变量的分布进行建模,并为协方差矩阵应用因子分析器结构。 ECM算法用于估计这些模型的参数。为了显示所提出的模型用于聚类的性能,提出了一些仿真研究和分析两个真实数据集的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号