首页> 外文期刊>Statistics and computing >Dimension reduction for model-based clustering
【24h】

Dimension reduction for model-based clustering

机译:基于模型的聚类的降维

获取原文
获取原文并翻译 | 示例
           

摘要

We introduce a dimension reduction method for visualizing the clustering structure obtained from a finite mixture of Gaussian densities. Information on the dimension reduction subspace is obtained from the variation on group means and, depending on the estimated mixture model, on the variation on group covariances. The proposed method aims at reducing the dimensionality by identifying a set of linear combinations, ordered by importance as quantified by the associated eigenvalues, of the original features which capture most of the cluster structure contained in the data. Observations may then be projected onto such a reduced subspace, thus providing summary plots which help to visualize the clustering structure. These plots can be particularly appealing in the case of high-dimensional data and noisy structure. The new constructed variables capture most of the clustering information available in the data, and they can be further reduced to improve clustering performance. We illustrate the approach on both simulated and real data sets.
机译:我们介绍了一种降维方法,用于可视化从高斯密度的有限混合中获得的聚类结构。关于维数减少子空间的信息是从组均值的变化以及组估计的混合模型的变化(取决于估计的混合模型)获得的。所提出的方法旨在通过识别一组原始特征的线性组合来降低维数,该线性组合按重要性排序(由相关特征值量化),该原始特征捕获了数据中包含的大多数聚类结构。然后可以将观测值投影到这种缩小的子空间上,从而提供有助于可视化聚类结构的摘要图。这些图在高维数据和嘈杂的结构中可能特别有吸引力。新构造的变量捕获了数据中可用的大多数聚类信息,并且可以进一步减少它们以提高聚类性能。我们说明了在模拟和真实数据集上的方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号