首页> 外文会议>International Conference on Database and Expert Systems Applications >Feature Selection Using Association Word Mining for Classification
【24h】

Feature Selection Using Association Word Mining for Classification

机译:使用关联词挖掘分类的功能选择

获取原文

摘要

In this paper, we propose effective feature selection method using association word mining. Documents are represented as association-word-vectors that include a few words instead of single words. The focus in this paper is the association rule in reduction of a high dimensional feature space. The accuracy and recall of document classification depend on the number of words for composing association words, confidence, and support at Apriori algorithm. We show how confidence, support, and the number of words for composing association words at Apriori algorithm are selected efficiently. We have used Naive Bayes classifier on text data using proposed feature-vector document representation. By experiment for categorizing documents, we have proved that feature selection method of association word mining is more efficient than information gain and document frequency.
机译:在本文中,我们提出了使用关联词挖掘的有效特征选择方法。文档表示为关联 - 字向量,包括几个单词而不是单个单词。本文的重点是减少高维特征空间的关联规则。文档分类的准确性和召回取决于组合协作词,信心和支持以APRIORI算法的单词数。我们展示了如何有效地选择在APRIORI算法中组合协作词的置信度,支持和单词的数量。我们使用了建议的功能 - 矢量文档表示使用了朴素的贝叶斯分类器在文本数据上。通过对文档进行分类,我们已经证明了关联词挖掘的特征选择方法比信息增益和文档频率更有效。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号