首页> 外文会议>International conference on omputational science and technology >Ideal Combination Feature Selection Model for Classification Problem based on Bio-Inspired Approach
【24h】

Ideal Combination Feature Selection Model for Classification Problem based on Bio-Inspired Approach

机译:基于生物启发方法的分类问题理想组合特征选择模型

获取原文

摘要

Feature selection or attribute reduction is a crucial process to achieve optimal data reduction for classification task. However, most of the feature selection methods that were introduced work individually that sometimes caused less optimal feature being selected, subsequently degrading the consistency of the classification accuracy rate. The aim of this paper is to exploit the capability of bio-inspired search algorithms, together with wrapper and filtered methods in generating optimal set of features. The important step is to idealize the combined feature selection models by finding the best combination of search method and feature selection algorithms. The next step is to define an optimized feature set for classification task. Performance metrics are analyzed based on classification accuracy and the number of selected features. Experiments were conducted on nine (9) benchmark datasets with various sizes, categorized as small, medium and large dataset. Experimental results revealed that the ideal combination is a feature selection model with the implementation of bio-inspired search algorithm that consistently obtains the optimal solution (i.e. less number of features with higher classification accuracy) on the selected dataset. Such a finding indicates that the exploitation of bio-inspired algorithms with ideal combination of wrapper/filtered method can contribute in finding the optimal features to be used in data mining model construction.
机译:特征选择或属性缩减是实现分类任务的最佳数据减少的重要过程。然而,大多数特征选择方法都是单独推出的,有时会引起更少的最佳特征,随后降低分类精度率的一致性。本文的目的是利用生物启发搜索算法的能力以及包装器和过滤的方法生成最佳特征。重要的步骤是通过找到搜索方法和特征选择算法的最佳组合来理解组合特征选择模型。下一步是为分类任务定义设置的优化功能。根据分类准确性和所选功能的数量分析性能指标。在九(9)个基准数据集中进行实验,具有各种尺寸,分为小,中等和大型数据集。实验结果表明,理想的组合是具有生物启发搜索算法的特征选择模型,该搜索算法一致地在所选数据集上始终获得最佳解决方案(即,具有较高分类精度的特征数量)。这种发现表明,具有理想组合的包装/过滤方法的生物启发算法的利用可以有助于找到数据挖掘模型结构中的最佳特征。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号