首页> 外文期刊>Knowledge-Based Systems >Missing value imputation using decision trees and decision forests by splitting and merging records: Two novel techniques
【24h】

Missing value imputation using decision trees and decision forests by splitting and merging records: Two novel techniques

机译:通过拆分和合并记录,使用决策树和决策森林进行价值插补的缺失:两种新颖的技术

获取原文
获取原文并翻译 | 示例
           

摘要

We present two novel techniques for the imputation of both categorical and numerical missing values. The techniques use decision trees and forests to identify horizontal segments of a data set where the records belonging to a segment have higher similarity and attribute correlations. Using the similarity and correlations, missing values are then imputed. To achieve a higher quality of imputation some segments are merged together using a novel approach. We use nine publicly available data sets to experimentally compare our techniques with a few existing ones in terms of four commonly used evaluation criteria. The experimental results indicate a clear superiority of our techniques based on statistical analyses such as confidence interval.
机译:我们提出两种新颖的技术,用于归类分类和数字缺失值。该技术使用决策树和森林来识别数据集的水平段,其中属于段的记录具有更高的相似度和属性相关性。使用相似性和相关性,然后估算缺失值。为了获得更高的插补质量,可以使用一种新颖的方法将某些段合并在一起。我们使用九个公开可用的数据集,根据四种常用的评估标准对我们的技术与一些现有的数据进行实验比较。实验结果表明,基于统计分析(例如置信区间),我们的技术具有明显的优势。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号