首页> 外文期刊>Statistics and computing >Mixture of latent trait analyzers for model-based clustering of categorical data
【24h】

Mixture of latent trait analyzers for model-based clustering of categorical data

机译:潜在特征分析器的混合,用于基于模型的分类数据聚类

获取原文
获取原文并翻译 | 示例
           

摘要

Model-based clustering methods for continuous data are well established and commonly used in a wide range of applications. However, model-based clustering methods for categorical data are less standard. Latent class analysis is a commonly used method for model-based clustering of binary data and/or categorical data, but due to an assumed local independence structure there may not be a correspondence between the estimated latent classes and groups in the population of interest. The mixture of latent trait analyzers model extends latent class analysis by assuming a model for the categorical response variables that depends on both a categorical latent class and a continuous latent trait variable; the discrete latent class accommodates group structure and the continuous latent trait accommodates dependence within these groups. Fitting the mixture of latent trait analyzers model is potentially difficult because the likelihood function involves an integral that cannot be evaluated analytically. We develop a variational approach for fitting the mixture of latent trait models and this provides an efficient model fitting strategy. The mixture of latent trait analyzers model is demonstrated on the analysis of data from the National Long Term Care Survey (NLTCS) and voting in the U.S. Congress. The model is shown to yield intuitive clus- tering results and it gives a much better fit than either latent class analysis or latent trait analysis alone.
机译:连续数据的基于模型的聚类方法已得到很好的建立,并广泛用于各种应用中。但是,用于分类数据的基于模型的聚类方法不太标准。潜在类别分析是用于二进制数据和/或分类数据的基于模型的聚类的常用方法,但是由于假定的局部独立性结构,因此潜在人群中估计的潜在类别和群体之间可能没有对应关系。潜在特征分析器的混合模型通过假设类别响应变量的模型扩展了潜在类分析,该模型既取决于类别潜在类又取决于连续潜在特征类变量。离散的潜在类别适应群体结构,连续的潜在特征适应这些群体内的依赖性。拟合潜在特征分析器模型的混合物可能很困难,因为似然函数涉及无法进行分析评估的积分。我们开发了一种变异方法来拟合潜在特征模型的混合,这提供了有效的模型拟合策略。在对美国国家长期护理调查(NLTCS)的数据进行分析并在美国国会进行表决后,证明了潜在特征分析器模型的混合。该模型显示出直观的集群结果,与单独的潜在类分析或潜在特征分析相比,拟合效果更好。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号