...
首页> 外文期刊>Knowledge and Data Engineering, IEEE Transactions on >Booster in High Dimensional Data Classification
【24h】

Booster in High Dimensional Data Classification

机译:高维数据分类的助推器

获取原文
获取原文并翻译 | 示例
           

摘要

Classification problems in high dimensional data with a small number of observations are becoming more common especially in microarray data. During the last two decades, lots of efficient classification models and feature selection (FS) algorithms have been proposed for higher prediction accuracies. However, the result of an FS algorithm based on the prediction accuracy will be unstable over the variations in the training set, especially in high dimensional data. This paper proposes a new evaluation measure Q-statistic that incorporates the stability of the selected feature subset in addition to the prediction accuracy. Then, we propose the Booster of an FS algorithm that boosts the value of the Q-statistic of the algorithm applied. Empirical studies based on synthetic data and 14 microarray data sets show that Booster boosts not only the value of the Q-statistic but also the prediction accuracy of the algorithm applied unless the data set is intrinsically difficult to predict with the given algorithm.
机译:带有少量观察结果的高维数据的分类问题变得越来越普遍,尤其是在微阵列数据中。在过去的二十年中,已经提出了许多有效的分类模型和特征选择(FS)算法,以实现更高的预测精度。但是,基于预测精度的FS算法的结果在训练集(尤其是在高维数据中)的变化范围内将不稳定。本文提出了一种新的评估指标Q统计量,该统计量除了预测精度外还融合了所选特征子集的稳定性。然后,我们提出了一种FS算法的Booster,它可以提高所应用算法的Q统计量的值。基于合成数据和14个微阵列数据集的经验研究表明,Booster不仅可以提高Q统计量的值,而且可以提高所应用算法的预测精度,除非使用给定算法本质上难以预测该数据集。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号