...
首页> 外文期刊>Journal of Bioinformatics and Computational Biology >MINIMUM REDUNDANCY FEATURE SELECTION FROM MICROARRAY GENE EXPRESSION DATA
【24h】

MINIMUM REDUNDANCY FEATURE SELECTION FROM MICROARRAY GENE EXPRESSION DATA

机译:从微阵列基因表达数据中选择最小冗余特征

获取原文
获取原文并翻译 | 示例
           

摘要

How to selecting a small subset out of the thousands of genes in microarray data is important for accurate classification of phenotypes. Widely used methods typically rank genes according to their differential expressions among phenotypes and pick the top-ranked genes. We observe that feature sets so obtained have certain redundancy and study methods to minimize it. We propose a minimum redundancy — maximum relevance (MRMR) feature selection framework. Genes selected via MRMR provide a more balanced coverage of the space and capture broader characteristics of phenotypes. They lead to significantly improved class predictions in extensive experiments on 6 gene expression data sets: NCI, Lymphoma, Lung, Child Leukemia, Leukemia, and Colon. Improvements are observed consistently among 4 classification methods: Na?ve Bayes, Linear discriminant analysis, Logistic regression, and Support vector machines.
机译:如何从微阵列数据的数千个基因中选择一小部分,对于表型的准确分类很重要。广泛使用的方法通常根据基因在表型之间的差异表达来对基因进行排名,并选择排名靠前的基因。我们观察到,如此获得的特征集具有一定的冗余性,并研究了将其最小化的方法。我们提出了一种最小冗余—最大相关性(MRMR)功能选择框架。通过MRMR选择的基因可提供更均衡的空间覆盖范围,并捕获更广泛的表型特征。在对6个基因表达数据集的广泛实验中,它们可显着改善班级预测:NCI,淋巴瘤,肺,儿童白血病,白血病和结肠癌。在以下4种分类方法中始终观察到改进:朴素贝叶斯,线性判别分析,Logistic回归和支持向量机。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号