首页> 外文期刊>Computational Biology and Bioinformatics, IEEE/ACM Transactions on >Optimal Aggregation of Binary Classifiers for Multiclass Cancer Diagnosis Using Gene Expression Profiles
【24h】

Optimal Aggregation of Binary Classifiers for Multiclass Cancer Diagnosis Using Gene Expression Profiles

机译:使用基因表达谱进行二分类器的最佳聚合,用于多类癌症诊断

获取原文
获取原文并翻译 | 示例
           

摘要

Multiclass classification is one of the fundamental tasks in bioinformatics and typically arises in cancer diagnosis studies by gene expression profiling. There have been many studies of aggregating binary classifiers to construct a multiclass classifier based on one-versus-the-rest (1R), one-versus-one (11), or other coding strategies, as well as some comparison studies between them. However, the studies found that the best coding depends on each situation. Therefore, a new problem, which we call the ldquooptimal coding problem,rdquo has arisen: how can we determine which coding is the optimal one in each situation? To approach this optimal coding problem, we propose a novel framework for constructing a multiclass classifier, in which each binary classifier to be aggregated has a weight value to be optimally tuned based on the observed data. Although there is no a priori answer to the optimal coding problem, our weight tuning method can be a consistent answer to the problem. We apply this method to various classification problems including a synthesized data set and some cancer diagnosis data sets from gene expression profiling. The results demonstrate that, in most situations, our method can improve classification accuracy over simple voting heuristics and is better than or comparable to state-of-the-art multiclass predictors.
机译:多类分类是生物信息学中的基本任务之一,通常通过基因表达谱分析在癌症诊断研究中出现。已有许多研究对二进制分类器进行聚合,以基于“一对多休息”(1R),“一对一”(11)或其他编码策略构造一个多类分类器,并进行了一些比较研究。但是,研究发现,最佳编码取决于每种情况。因此,出现了一个称为“最佳编码问题”的新问题:如何确定每种情况下哪种编码是最佳编码?为了解决该最佳编码问题,我们提出了一种用于构造多类分类器的新颖框架,其中要聚合的每个二进制分类器都具有一个权重值,该权重值将根据观察到的数据进行优化。尽管没有最优编码问题的先验答案,但是我们的权重调整方法可以是该问题的一致答案。我们将此方法应用于各种分类问题,包括综合数据集和一些来自基因表达谱的癌症诊断数据集。结果表明,在大多数情况下,我们的方法可以比简单的投票启发法提高分类的准确性,并且优于或可与最新的多类预测器相媲美。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号