...
首页> 外文期刊>IEEE Transactions on Knowledge and Data Engineering >A Bayesian Classification Approach Using Class-Specific Features for Text Categorization
【24h】

A Bayesian Classification Approach Using Class-Specific Features for Text Categorization

机译:使用特定于类的特征进行文本分类的贝叶斯分类方法

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

In this paper, we present a Bayesian classification approach for automatic text categorization using class-specific features. Unlike conventional text categorization approaches, our proposed method selects a specific feature subset for each class. To apply these class-specific features for classification, we follow Baggenstoss's PDF Projection Theorem (PPT) to reconstruct the PDFs in raw data space from the class-specific PDFs in low-dimensional feature subspace, and build a Bayesian classification rule. One noticeable significance of our approach is that most feature selection criteria, such as Information Gain (IG) and Maximum Discrimination (MD), can be easily incorporated into our approach. We evaluate our method's classification performance on several real-world benchmarks, compared with the state-of-the-art feature selection approaches. The superior results demonstrate the effectiveness of the proposed approach and further indicate its wide potential applications in data mining.
机译:在本文中,我们提出了一种使用特定于类的功能进行自动文本分类的贝叶斯分类方法。与传统的文本分类方法不同,我们提出的方法为每个类选择一个特定的特征子集。为了将这些特定于类别的特征应用于分类,我们遵循Baggenstoss的PDF投影定理(PPT)从低维特征子空间中特定于类别的PDF重建原始数据空间中的PDF,并建立贝叶斯分类规则。我们的方法的一个显着意义是,大多数功能选择标准,例如信息增益(IG)和最大歧视(MD),都可以轻松地纳入我们的方法中。与最先进的特征选择方法相比,我们在多个实际基准上评估了该方法的分类性能。优异的结果证明了该方法的有效性,并进一步表明了其在数据挖掘中的广泛应用潜力。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号