首页> 外文会议>International Conference on Advances in Intelligent Computing(ICIC 2005); 20050823-26; Hefei(CN) >Borderline-SMOTE: A New Over-Sampling Method in Imbalanced Data Sets Learning
【24h】

Borderline-SMOTE: A New Over-Sampling Method in Imbalanced Data Sets Learning

机译:Borderline-SMOTE:不平衡数据集学习中的一种新的过采样方法

获取原文
获取原文并翻译 | 示例

摘要

In recent years, mining with imbalanced data sets receives more and more attentions in both theoretical and practical aspects. This paper introduces the importance of imbalanced data sets and their broad application domains in data mining, and then summarizes the evaluation metrics and the existing methods to evaluate and solve the imbalance problem. Synthetic minority over-sampling technique (SMOTE) is one of the over-sampling methods addressing this problem. Based on SMOTE method, this paper presents two new minority over-sampling methods, borderline-SMOTE1 and borderline-SMOTE2, in which only the minority examples near the borderline are over-sampled. For the minority class, experiments show that our approaches achieve better TP rate and F-value than SMOTE and random over-sampling methods.
机译:近年来,使用不平衡数据集进行挖掘在理论和实践方面都受到越来越多的关注。本文介绍了不平衡数据集及其在数据挖掘中的广泛应用领域的重要性,然后总结了评估指标和评估和解决不平衡问题的现有方法。合成少数派过采样技术(SMOTE)是解决此问题的一种过采样方法。基于SMOTE方法,本文提出了两种新的少数群体过采样方法,borderline-SMOTE1和borderline-SMOTE2,其中仅对边界附近的少数群体样本进行过采样。对于少数群体,实验表明,与SMOTE和随机过采样方法相比,我们的方法可实现更好的TP速率和F值。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号