...
首页> 外文期刊>Knowledge and Information Systems >Boosting support vector machines for imbalanced data sets
【24h】

Boosting support vector machines for imbalanced data sets

机译:提升支持向量机的不平衡数据集

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Real world data mining applications must address the issue of learning from imbalanced data sets. The problem occurs when the number of instances in one class greatly outnumbers the number of instances in the other class. Such data sets often cause a default classifier to be built due to skewed vector spaces or lack of information. Common approaches for dealing with the class imbalance problem involve modifying the data distribution or modifying the classifier. In this work, we choose to use a combination of both approaches. We use support vector machines with soft margins as the base classifier to solve the skewed vector spaces problem. We then counter the excessive bias introduced by this approach with a boosting algorithm. We found that this ensemble of SVMs makes an impressive improvement in prediction performance, not only for the majority class, but also for the minority class.
机译:现实世界中的数据挖掘应用程序必须解决从不平衡数据集中学习的问题。当一个类中的实例数大大超过另一类中的实例数时,就会出现此问题。由于向量空间偏斜或信息不足,此类数据集通常会导致构建默认分类器。处理类不平衡问题的常用方法包括修改数据分布或修改分类器。在这项工作中,我们选择使用两种方法的组合。我们使用具有软边距的支持向量机作为基础分类器来解决向量空间偏斜的问题。然后,我们使用增强算法来抵消这种方法引入的过度偏差。我们发现,这种支持向量机的集成不仅对多数类别,而且对于少数类别,都在预测性能方面取得了令人印象深刻的改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号