首页> 外文期刊>Web Intelligence and Agent Systems >Application of rough ensemble classifier to web services categorization and focused crawling
【24h】

Application of rough ensemble classifier to web services categorization and focused crawling

机译:粗集成分类器在Web服务分类和集中爬网中的应用

获取原文
获取原文并翻译 | 示例
           

摘要

This paper discusses the applications of rough ensemble classifier [27] in two emerging problems of web mining, the categorization of web services and the topic specific web crawling. Both applications, discussed here, consist of two major steps:rn(1) split of feature space based on internal tag structure of web services and hypertext to represent in a tensor space model, andrn(2) combining classifications obtained on different tensor components using rough ensemble classifier. In the first application we have discussed the classification of web services. Two step improvement on the existing classification results of web services has been shown here. In the first step we achieve better classification results over existing, by using tensor space model. In the second step further improvement of the results has been obtained by using Rough set based ensemble classifier. In the second application we have discussed the focused crawling using rough ensemble prediction. Our experiment regarding this application has provided better Harvest rate and better Target recall for focused crawling.
机译:本文讨论了粗糙集合分类器[27]在两个新出现的Web挖掘问题中的应用,即Web服务的分类和主题特定的Web爬网。此处讨论的这两个应用程序都包含两个主要步骤:rn(1)基于Web服务和超文本的内部标签结构分割特征空间以在张量空间模型中表示,以及rn(2)结合使用不同张量分量获得的分类粗分类器在第一个应用程序中,我们讨论了Web服务的分类。这里显示了对Web服务的现有分类结果的两步改进。第一步,我们通过使用张量空间模型获得比现有更好的分类结果。在第二步中,通过使用基于粗糙集的集成分类器获得了结果的进一步改进。在第二个应用程序中,我们讨论了使用粗集成预测的集中式爬网。我们针对此应用程序进行的实验为集中抓取提供了更好的收成率和更好的目标召回率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号