Booster in High Dimensional Data Classification

Kim HyunJi; Choi Byong Su; Huh Moon Yul

首页> 外文期刊>Knowledge and Data Engineering, IEEE Transactions on >Booster in High Dimensional Data Classification

【24h】

Booster in High Dimensional Data Classification

机译：高维数据分类的助推器

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Classification problems in high dimensional data with a small number of observations are becoming more common especially in microarray data. During the last two decades, lots of efficient classification models and feature selection (FS) algorithms have been proposed for higher prediction accuracies. However, the result of an FS algorithm based on the prediction accuracy will be unstable over the variations in the training set, especially in high dimensional data. This paper proposes a new evaluation measure Q-statistic that incorporates the stability of the selected feature subset in addition to the prediction accuracy. Then, we propose the Booster of an FS algorithm that boosts the value of the Q-statistic of the algorithm applied. Empirical studies based on synthetic data and 14 microarray data sets show that Booster boosts not only the value of the Q-statistic but also the prediction accuracy of the algorithm applied unless the data set is intrinsically difficult to predict with the given algorithm.

机译：带有少量观察结果的高维数据的分类问题变得越来越普遍，尤其是在微阵列数据中。在过去的二十年中，已经提出了许多有效的分类模型和特征选择（FS）算法，以实现更高的预测精度。但是，基于预测精度的FS算法的结果在训练集（尤其是在高维数据中）的变化范围内将不稳定。本文提出了一种新的评估指标Q统计量，该统计量除了预测精度外还融合了所选特征子集的稳定性。然后，我们提出了一种FS算法的Booster，它可以提高所应用算法的Q统计量的值。基于合成数据和14个微阵列数据集的经验研究表明，Booster不仅可以提高Q统计量的值，而且可以提高所应用算法的预测精度，除非使用给定算法本质上难以预测该数据集。

著录项

来源
《Knowledge and Data Engineering, IEEE Transactions on》 |2016年第1期|29-40|共12页
作者
Kim HyunJi; Choi Byong Su; Huh Moon Yul;
展开▼
作者单位

Korea Fair Trade Mediation Agency, Korea;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Booster; High dimensional data classification; Q-statistic; feature selection; high dimensional data classification; stability;

机译：Booster;高维数据分类;Q统计;特征选择;高维数据分类;稳定性;

相似文献

外文文献
中文文献
专利

1. Direct classification of high-dimensional data in low-dimensional projected feature spaces--comparison of several classification methodologies. [J] . Somorjai RL, Dolenko B, Mandelzweig M Journal of biomedical informatics. . 2007,第2期

机译：在低维投影特征空间中对高维数据进行直接分类-几种分类方法的比较。
2. Revisiting subject classification in academic databases: A comparison of the classification accuracy of Web of Science, Scopus & Dimensions [J] . Singh Prashasti, Piryani Rajesh, Singh Vivek Kumar, Journal of intelligent & fuzzy systems: Applications in Engineering and Technology . 2020,第2Pta2期

机译：在学术数据库中重新审视主题分类：科学网站，Scopus＆尺寸的分类准确性的比较
3. Feature Selection, Mutual Information, And The Classification Of High-dimensional Patternsapplications To Image Classification And Microarray Data Analysis [J] . Boyan Bonev, Francisco Escolano Miguel Cazorla Pattern Analysis and Applications . 2008,第3a4期

机译：特征选择，互信息和高维模式分类在图像分类和微阵列数据分析中的应用
4. DATA MINING BY MOUCLAS: A FUZZY APPROACH TO CLASSIFICATION OVER QUANTITATIVE DATA IN HIGH DIMENSIONAL DATABASE [C] . Yalei Hao, Gerald Quirchmayr, Markus Stumptner International Conference on Fuzzy Information Processing: Theories and Applications vol.1; 20030301-04; Beijing(CN) . 2003

机译：MOUCLAS进行数据挖掘：对高维数据库中的定量数据进行分类的一种模糊方法
5. Data-driven transforms for exploration, visualization and classification of high-dimensional data. [D] . Perez, Dragana Veljkovic. 2010

机译：数据驱动的转换，用于高维数据的探索，可视化和分类。
6. Plurigon: three dimensional visualization and classification of high-dimensionality data [O] . Bronwen Martin, Hongyu Chen, Caitlin M. Daimon, 2013

机译：Plurigon：三维可视化和高维数据分类
7. Direct classification of high-dimensional data in low-dimensional projected feature spaces—Comparison of several classification methodologies [O] . Somorjai R.L., Dolenko B., Mandelzweig M. 2007

机译：在低维投影特征空间中直接对高维数据进行分类—几种分类方法的比较

Booster in High Dimensional Data Classification

摘要

著录项

相似文献

相关主题

期刊订阅