首页> 外文会议>Database and Expert Systems Applications >Feature Selection Using Association Word Mining for Classification
【24h】

Feature Selection Using Association Word Mining for Classification

机译:使用关联词挖掘进行分类的特征选择

获取原文

摘要

In this paper, we propose effective feature selection method using association word mining. Documents are represented as association-word-vectors that include a few words instead of single words. The focus in this paper is the association rule in reduction of a high dimensional feature space. The accuracy and recall of document classification depend on the number of words for composing association words, confidence, and support at Apriori algorithm. We show how confidence, support, and the number of words for composing association words at Apriori algorithm are selected efficiently. We have used Naive Bayes classifier on text data using proposed feature-vector document representation. By experiment for categorizing documents, we have proved that feature selection method of association word mining is more efficient than information gain and document frequency.
机译:本文提出了一种利用关联词挖掘的有效特征选择方法。文档表示为关联词向量,其中包括几个词而不是单个词。本文的重点是减少高维特征空间的关联规则。文档分类的准确性和召回率取决于构成关联词的词数,置信度和Apriori算法的支持。我们展示了如何有效地选择Apriori算法中用于组成联想词的置信度,支持度和词数。我们已使用拟议的特征向量文档表示法对文本数据使用了朴素贝叶斯分类器。通过对文档进行分类的实验,我们证明了关联词挖掘的特征选择方法比信息获取和文档频率更有效。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号