...
首页> 外文期刊>Quality Control and Applied Statistics >Scale-invariant sparse PCA on high-dimensional meta-elliptical data
【24h】

Scale-invariant sparse PCA on high-dimensional meta-elliptical data

机译:高维元椭圆数据上的尺度不变稀疏PCA

获取原文
获取原文并翻译 | 示例
           

摘要

Purpose: To propose a semi-parametric method for conducting scale-invariant sparse principal component analysis (PCA) on high-dimensional non-Caussian data, that is has a weaker modeling assumption and is more robust to possible data contamination compared with sparse PCA. Summary: PCA is an efficient approach for dimension reduction and feature selection. The dimension that is small compared to the sample size can be estimated by eigenvectors of a sample covariance matrix. However when the dimension increases compared to the sample size this method produces poor estimates. To overcome this one approach used is to use sparsity constraints on eigenvectors. Forthis purpose different variants of sparse PCA has been developed and their theoretical properties have been investigated. The drawbacks of PCA and sparse PCA are: They are not scale invariant and are affected when measurement scales are changed, They are not robust to data contaminations or outliers, and They rely on Gaussian or sub-Caussian assumption, which may not be true.
机译:目的:提出一种半参数方法,用于对高维非高斯数据进行尺度不变的稀疏主成分分析(PCA),与稀疏PCA相比,该方法具有较弱的建模假设,并且对可能的数据污染具有更强的鲁棒性。简介:PCA是减少尺寸和选择特征的有效方法。与样本大小相比较小的维度可以通过样本协方差矩阵的特征向量来估计。但是,当维数与样本大小相比增加时,此方法将产生较差的估计。为了克服这一问题,使用的一种方法是对特征向量使用稀疏约束。为此,已经开发了稀疏PCA的不同变体,并研究了它们的理论特性。 PCA和稀疏PCA的缺点是:它们不是尺度不变的,并且在更改测量尺度时会受到影响;它们对数据污染或离群值不具有鲁棒性;并且它们依赖于高斯或亚高斯假设,这可能不是正确的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号