【24h】

Two-Dimensional-Reduction Random Forest

机译:二维约简随机森林

获取原文

摘要

Random forest (RF) is a competitive machine learning theorem, while one of the big challenges for it is imbalanced real-world data. A Two-dimensional-reduction RF (2DRRF) is presented in this paper, which is optimized based on traditional RF and three innovation points as follows. To improve RF in terms of performance on imbalanced data, a two-dimensional-reduction approach is created. Then, a modified T-link is proposed focusing on detecting and reducing safe samples. Moreover, a biased sampling manner is employed to build up optimal training datasets. Across 13 imbalanced datasets from KEEL-dataset with imbalance-ratio ranging from 6.38 to 129.44, experiments are carried out indicating that 2DRRF steadily holds advantages over the other two relevant implementations of RF in terms of accuracy, recall, precision and F-value.
机译:随机森林(RF)是一种竞争性的机器学习定理,而其最大的挑战之一是真实世界数据的不平衡。本文提出了一种二维降维RF(2DRRF),它是在传统RF和以下三个创新点的基础上进行了优化的。为了提高射频在不平衡数据上的性能,创建了二维缩减方法。然后,提出了一种改进的T-link,其重点是检测和减少安全样本。此外,采用有偏抽样的方式来建立最佳训练数据集。在KEEL数据集的13个失衡数据集中,失衡比率范围为6.38至129.44,实验表明2DRRF在准确性,查全率,精确度和F值方面稳步保持优于RF的其他两个相关实现的优势。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号