Boosting support vector machines for imbalanced data sets

Benjamin X. Wang; Nathalie Japkowicz

首页> 外文期刊>Knowledge and Information Systems >Boosting support vector machines for imbalanced data sets

【24h】

Boosting support vector machines for imbalanced data sets

机译：提升支持向量机的不平衡数据集

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Real world data mining applications must address the issue of learning from imbalanced data sets. The problem occurs when the number of instances in one class greatly outnumbers the number of instances in the other class. Such data sets often cause a default classifier to be built due to skewed vector spaces or lack of information. Common approaches for dealing with the class imbalance problem involve modifying the data distribution or modifying the classifier. In this work, we choose to use a combination of both approaches. We use support vector machines with soft margins as the base classifier to solve the skewed vector spaces problem. We then counter the excessive bias introduced by this approach with a boosting algorithm. We found that this ensemble of SVMs makes an impressive improvement in prediction performance, not only for the majority class, but also for the minority class.

机译：现实世界中的数据挖掘应用程序必须解决从不平衡数据集中学习的问题。当一个类中的实例数大大超过另一类中的实例数时，就会出现此问题。由于向量空间偏斜或信息不足，此类数据集通常会导致构建默认分类器。处理类不平衡问题的常用方法包括修改数据分布或修改分类器。在这项工作中，我们选择使用两种方法的组合。我们使用具有软边距的支持向量机作为基础分类器来解决向量空间偏斜的问题。然后，我们使用增强算法来抵消这种方法引入的过度偏差。我们发现，这种支持向量机的集成不仅对多数类别，而且对于少数类别，都在预测性能方面取得了令人印象深刻的改进。

著录项

来源
《Knowledge and Information Systems》 |2010年第1期|p.1-20|共20页
作者
Benjamin X. Wang; Nathalie Japkowicz;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Imbalanced data sets; Support vector machines; Boosting;

机译：数据集不平衡;支持向量机;助推;

相似文献

外文文献
中文文献
专利

1. Boosting support vector machines for imbalanced data sets [J] . Benjamin X. Wang, Nathalie Japkowicz Knowledge and information systems . 2010,第1期

机译：提升支持向量机的不平衡数据集
2. Boosting Support Vector Machines for Imbalanced Microarray Data [J] . Risky Frasetio Wahyu Pratama, Santi Wulan Purnami, Santi Puteri Rahayu Procedia Computer Science . 2018,第22期

机译：Boosting支持向量机用于不平衡微阵列数据
3. Affinity and class probability-based fuzzy support vector machine for imbalanced data sets [J] . Tao Xinmin, Li Qing, Ren Chao, Neural Networks: The Official Journal of the International Neural Network Society . 2020,第期

机译：基于亲和力和类概率的模糊支持向量机用于实施数据集
4. Boosting Support Vector Machines for Imbalanced Data Sets [C] . Benjamin X. Wang, Nathalie Japkowicz Foundations of Intelligent Systems . 2008

机译：提升不平衡数据集的支持向量机
5. A selective sampling method for imbalanced data learning on support vector machines. [D] . Choi, Jong Myong. 2010

机译：一种在支持向量机上进行不平衡数据学习的选择性采样方法。
6. Enhancement of hepatitis virus immunoassay outcome predictions in imbalanced routine pathology data by data balancing and feature selection before the application of support vector machines [O] . Alice M. Richardson, Brett A. Lidbury 2017

机译：在支持向量机应用之前通过数据平衡和特征选择来增强不平衡常规病理数据中肝炎病毒免疫测定结果的预测
7. Boosting Support Vector Machines for Imbalanced Data Sets [O] . Benjamin X. Wang, Nathalie Japkowicz 2010

机译：提升支持向量机的不平衡数据集

Boosting support vector machines for imbalanced data sets

摘要

著录项

相似文献

相关主题

期刊订阅