...
【24h】

Evolution of document networks

机译:文件网络的演变

获取原文
获取原文并翻译 | 示例
           

摘要

How does a network of documents grow without centralized control? This question is becoming crucial as we try to explain the emergent scale-free topology of the World Wide Web and use link analysis to identify important information resources. Existing models of growing information networks have focused on the structure of links but neglected the content of nodes. Here I show that the current models fail to reproduce a critical characteristic of information networks, namely the distribution of textual similarity among linked documents. I propose a more realistic model that generates links by using both popularity and content. This model yields remarkably accurate predictions of both degree and similarity distributions in networks of web pages and scientific literature.
机译:没有集中控制,文档网络将如何发展?当我们试图解释出现的万维网无标度拓扑并使用链接分析来识别重要的信息资源时,这个问题变得至关重要。成长中的信息网络的现有模型集中在链接的结构上,却忽略了节点的内容。在这里,我表明当前的模型无法重现信息网络的关键特征,即链接文档之间文本相似性的分布。我提出了一个更现实的模型,该模型通过使用流行度和内容来生成链接。该模型可对网页和科学文献网络中的程度和相似性分布做出非常准确的预测。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号