首页> 外文会议>International Conference on Information and Communication Technology Convergence >Oversampling for Imbalanced Data Classification Using Adversarial Network
【24h】

Oversampling for Imbalanced Data Classification Using Adversarial Network

机译:使用对抗网络的不平衡数据分类过采样

获取原文

摘要

The imbalanced data classification problem occurs when the number of samples for one class is much lower than for the other class. In most classification algorithms, the class imbalance is key reason of performance degradation. One way to address the imbalancing issue is to balance them, either by oversampling instances of the minority class or undersampling instances of the majority class. In this paper, we propose an oversampling method for imbalanced data classification using an adversarial network. Firstly, a synthetic minority dataset is generated with a black box oversampler and refined using the refiner network. To bridge a gap between synthetic and real dataset, we train the refiner network using an adversarial loss. The adversarial loss fools a discriminator network that classifies a dataset as real or refined. Experimental results show that the proposed method has high performance comparing with the most common oversampling method.
机译:当一个类别的样本数量比另一类别的样本数量少得多时,就会出现不平衡的数据分类问题。在大多数分类算法中,类不平衡是性能下降的主要原因。解决不平衡问题的一种方法是通过对少数群体实例进行过度采样或对多数群体实例进行欠采样来平衡它们。在本文中,我们提出了一种使用对抗网络进行不平衡数据分类的过采样方法。首先,使用黑匣子过采样器生成合成少数数据集,并使用精简器网络进行精炼。为了弥合合成数据集和真实数据集之间的差距,我们使用对抗性损失来训练精炼机网络。对抗性损失使鉴别网络蒙骗,该鉴别网络将数据集归类为真实数据或精化数据。实验结果表明,与最常见的过采样方法相比,该方法具有较高的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号