首页> 外文期刊>Expert systems with applications >The Chinese text categorization system with association rule and category priority
【24h】

The Chinese text categorization system with association rule and category priority

机译:具有关联规则和类别优先级的中文文本分类系统

获取原文
获取原文并翻译 | 示例
           

摘要

The process of text categorization involves some understanding of the content of the documents and/or some previous knowledge of the categories. For the content of the documents, we use the filtering measure for feature selection in our Chinese text categorization system. We modify the formula of TFIDF to strengthen important keywords' weights and weaken unimportant keywords' weights. For the knowledge of the categories, we use association rules to improve the precision of text classification and use category priority to represent the relationship between two different categories. Consequently, the experimental results show that our method can effectively not only decrease noise text but also increase the ratio of precision and recall of text categorization.
机译:文本分类过程涉及对文档内容的一些理解和/或有关类别的某些先前知识。对于文档的内容,我们在中文文本分类系统中使用过滤度量进行特征选择。我们修改了TFIDF的公式,以增强重要关键字的权重并削弱不重要关键字的权重。对于类别的知识,我们使用关联规则来提高文本分类的精度,并使用类别优先级来表示两个不同类别之间的关系。因此,实验结果表明,该方法不仅可以有效减少噪声文本,而且可以提高文本分类的精度和召回率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号