首页> 中文期刊> 《计算机应用研究》 >利用博客链接平台选取联合关键字的博客聚类方法

利用博客链接平台选取联合关键字的博客聚类方法

         

摘要

针对全文本关键字检索的时间成本高、采用标签/类别会产生语句歧义和同义词等问题,提出在博客链接平台上选取联合关键字进行博客聚类.假设一个博客文章被查询的候选关键字(或者联合关键字)可以用于表示这个博客文章的主题,为验证该假设,首先将跟踪代码嵌入到博客链接(BC)组件中,以收集读者查询的关键字;然后,选取适当的候选关键字作为联合关键字;最后,使用重叠投影、交互信息投影、分布式分布信息和肯德尔τ系数这四种相似性度量以验证BC组件提取的联合关键字.实验结果表明,提出的方法可以为查询者提供一条找到对应博客的快速通道;此外,生成的联合关键字可以减少全文本关键字检索过程的复杂度和冗余度,很好地满足了博客用户的需求.%Concerning that the time cost of full-text keyword searching is high,and the label / category statement will produce ambiguity and synonyms problems,this paper proposed a way to select joint keywords in the blog connect platform for blog clustering.This method assumed that the candidate keywords (or joint keyword) of a blog post by querying could be used to represent the theme of this blog.In order to verify this assumption,firstly,it embedded a tracing code in blog connect so as to collect the keywords queried by readers.Then,it used FKRP to select candidate keywords as co-keywords.Finally,it used the similarity measures,including overlapping projection,mutual information projection,distributed information and the Kendall τ coefficient to validate the BC component extraction.The experimental results show that the proposed method can provide a fast channel for the query to find the corresponding blog.In addition,the joint key generation can reduce the search process' s complexity and redundancy,which can well meet the needs of blog users.

著录项

相似文献

  • 中文文献
  • 外文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号