基于最大流及页面相似度的Web结构挖掘

李莹; 吴晓军

首页> 中文期刊> 《计算机技术与发展》 >基于最大流及页面相似度的Web结构挖掘

基于最大流及页面相似度的Web结构挖掘

开具论文收录证明 >>

期刊封面封底目录下载 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

针对Web结构挖掘算法容易出现“主题漂移”以及主机间的多重互相加强关系的问题,提出了一种基于最大流与页面相似度值的超链接结构挖掘方法.该方法在传统的超链接结构挖掘算法HITS的基础上引入页面相似度值构造邻接矩阵,并结合基于最大流的Web社区发现技术来构建特征向量空间模型,通过迭代计算最终获得价值最高的权威结果集和中心结果集.实验结果证明该方法有较好的查准率与查全率,并有效抑制了“主题漂移”现象,具有一定的实用价值.%Aiming to Web structure mining algorithm is easy for a " topic drift" and mutually strengthening relations among the hots of the problem, a method of hyperlink structure mining based on the maximum flow and the page similarity value is presented. On the basis of traditional HITS algorithm, this method introduced the page similarity value and adopted the Web communities identification based on the maximum flow to construct the models of feature vector space. And then the calculation eventually won the highest value of authority-set and hub-set by iterative method. Experimental results show that the method has better recall and precision, what' s more it effectively inhibits the theme of Web structure mining algorithms drift, has some practical value.

著录项

来源
《计算机技术与发展》 |2011年第10期|112-115|共4页
作者
李莹; 吴晓军;
展开▼
作者单位

陕西师范大学计算机科学学院;

陕西西安710062;

陕西师范大学计算机科学学院;

陕西西安710062;

展开▼
原文格式 PDF
正文语种 chi
中图分类算法理论;
关键词
Web结构挖掘; 主题漂移; 页面相似度值;

相似文献

中文文献
外文文献
专利

1. 基于页面相似度的PageRank算法 [J] . 王丰 ,俞成海 ,汪佳文 . 浙江理工大学学报 . 2017,第002期
2. 基于链接关系的Web页面相似度搜索 [J] . 靳黛露 ,张月琴 ,张明西 . 计算机应用与软件 . 2014,第001期
3. 基于写页面热度的混合内存页面管理策略 [J] . 杜娇 ,钱育蓉 ,张猛 . 东北师大学报：自然科学版 . 2021,第2期
4. 基于页面集的异步刷新页面爬取技术研究 [J] . 张萌 . 科技创新导报 . 2020,第024期
5. 基于页面模型的引擎式快速页面构造服务 [J] . 王琰洁 ,陈刚 ,石超 . 计算机系统应用 . 2016,第010期
6. 基于页面模板和配置文件的web页面生成方法 [C] . 朱雷 ,袁兆山 ,潘玲 . 全国第18届计算机技术与应用学术会议(CACIS) . 2007
7. 基于快速相似度的Web结构挖掘的研究 [A] . 马燕 . 2011

基于最大流及页面相似度的Web结构挖掘

摘要

著录项

相似文献

相关主题

期刊订阅