首页> 外文会议>Optimization and Systems Biology >The Imbalanced Problem in Mass-spectrometry Data Analysis
【24h】

The Imbalanced Problem in Mass-spectrometry Data Analysis

机译:质谱数据分析中的不平衡问题

获取原文
获取原文并翻译 | 示例

摘要

In many cases, protein mass-spectrometry data are imbalanced, i.e. the number of positive examples is much less than that of negative ones, which generally degrade the performance of classifiers used for protein recognition. Despite its importance, few works have been conducted to handle this problem. In this paper, we present a new method that utilizes the EasyEnsemble algorithm to cope with the imbalance problem in mass-spectrometry data. Furthermore, two feature selection algorithms, namely PREE (Prediction Risk based feature selection for EasyEnsemble) and PRIEE (Prediction Risk based feature selection for Individuals of EasyEnsemble), are proposed to select informative features and improve the performance of the EasyEnsemble classifier. Experimental results on three mass spectra data sets demonstrate that the proposed methods outperform two existing filter feature selection methods, which prove the effectiveness of the proposed methods.
机译:在许多情况下,蛋白质质谱数据是不平衡的,即,阳性样品的数量远少于阴性样品的数量,这通常会降低用于蛋白质识别的分类器的性能。尽管它很重要,但为解决这个问题所做的工作很少。在本文中,我们提出了一种利用EasyEnsemble算法来解决质谱数据中的不平衡问题的新方法。此外,提出了两种特征选择算法,即PREE(EasyEnsemble的基于预测风险的特征选择)和PRIEE(EasyEnsemble的个人基于预测风险的特征选择),以选择有用的特征并改善EasyEnsemble分类器的性能。在三个质谱数据集上的实验结果表明,所提方法优于两种现有的滤波器特征选择方法,证明了所提方法的有效性。

著录项

  • 来源
    《Optimization and Systems Biology》|2008年|136-143|共8页
  • 会议地点 Lijiang(CN);Lijiang(CN)
  • 作者单位

    Hao-Hua Meng@School of Computer Engineering and Science, Shanghai University, Shanghai 200072, China--Guo-Zheng Li@Department of Control Science and Engineering, Tongji University, Shanghai 201804, China--Rui-Sheng Wang@School of Information, Renmin University of China, Beijing 100872, China--Xing-Ming Zhao@Institute of System Biology, Shanghai University, Shanghai 200444, China--Luonan Chen@Institute of System Biology, Shanghai University, Shanghai 200444, China--;

  • 会议组织
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 生物工程学(生物技术);
  • 关键词

    Mass-spectrometry; Feature selection; Ensemble;

    机译:质谱;特征选择;集合;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号