...
首页> 外文期刊>Information Theory, IEEE Transactions on >Outlier-Robust PCA: The High-Dimensional Case
【24h】

Outlier-Robust PCA: The High-Dimensional Case

机译:异常强健的PCA:高维案例

获取原文
获取原文并翻译 | 示例
           

摘要

Principal component analysis plays a central role in statistics, engineering, and science. Because of the prevalence of corrupted data in real-world applications, much research has focused on developing robust algorithms. Perhaps surprisingly, these algorithms are unequipped—indeed, unable—to deal with outliers in the high-dimensional setting where the number of observations is of the same magnitude as the number of variables of each observation, and the dataset contains some (arbitrarily) corrupted observations. We propose a high-dimensional robust principal component analysis algorithm that is efficient, robust to contaminated points, and easily kernelizable. In particular, our algorithm achieves maximal robustness—it has a breakdown point of 50% (the best possible), while all existing algorithms have a breakdown point of zero. Moreover, our algorithm recovers the optimal solution exactly in the case where the number of corrupted points grows sublinearly in the dimension.
机译:主成分分析在统计,工程和科学中起着核心作用。由于在实际应用中普遍存在损坏的数据,因此许多研究都集中在开发健壮的算法上。也许令人惊讶的是,这些算法并没有配备(实际上是无法)在高维设置中处理异常值,在高维设置中,观察值的数量与每个观察值的变量的数量相同,并且数据集包含一些(任意)损坏的观察。我们提出了一种高维的,鲁棒的主成分分析算法,该算法高效,对污染点具有鲁棒性并且易于内核化。特别是,我们的算法实现了最大的鲁棒性-击穿点为50%(最佳),而所有现有算法的击穿点为零。此外,在损坏点的数量在维度上呈亚线性增长的情况下,我们的算法可以准确地恢复最优解。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号