首页> 外文期刊>Chinese science bulletin >Small-world patterns in Chinese phrase networks
【24h】

Small-world patterns in Chinese phrase networks

机译:汉语短语网络中的小世界模式

获取原文
获取原文并翻译 | 示例
           

摘要

Recently, the structure and function of complex networks have become one of the hottest topics in statistical physics and interdisciplinary sciences. Studies have shown that real networks containing a huge mount of nodes do not form and evolve in a random way as expected, but they display peculiar features. The most surprising one is the small-world effect, which is commonly shared by food webs, the web of human sexual contacts, word networks, etc. Moreover, the scale-free property of degree distributions also emerges on Internet and protein networks. The studies on English words demonstrate that English Word Networks(EWN) exhibit a small world effect, and every two nodes of word follow three degrees of separation; in other words, one can find the target words by only three steps searching averagely. Chinese is a widely used language, only second to English. The appearance and development of modern communication means, such as computer, mobile phone, and the Internet, call for more perfect techniques in Chinese word processing. One of the most important steps of word processing is character searching, the rate of which affects the efficiency directly. And the high-efficient searching requires a proper storage of word information. Different from English, a Chinese character is like a square picture in shape, which makes its storage and search rather difficult. Accordingly, it has ever been believed, since the 1950s, that Chinese word processing is harder than English word processing. However, this paper reveals that EWN and Chinese Phrases Networks (CPN) have striking similarities on emergence. It is possible for Chinese phrases to have the same searching rate as English if stored and processed properly.
机译:近年来,复杂网络的结构和功能已成为统计物理学和跨学科科学中最热门的主题之一。研究表明,包含大量节点的真实网络不会像预期的那样以随机的方式形成和演化,而是显示出独特的功能。最令人惊讶的是小世界效应,通常由食物网,人类性接触网,单词网络等共享。此外,度分布的无标度特性也出现在互联网和蛋白质网络上。对英语单词的研究表明,英语单词网络(EWN)表现出很小的世界效应,单词的每两个节点遵循三个分离度;换句话说,通过平均搜索三个步骤就可以找到目标词。中文是一种广泛使用的语言,仅次于英语。计算机,移动电话和互联网等现代通讯手段的出现和发展,要求在中文文字处理方面有更完善的技术。文字处理最重要的步骤之一是字符搜索,其速度直接影响效率。并且高效搜索需要正确存储单词信息。与英文不同,汉字的形状像正方形的图片,因此很难进行存储和搜索。因此,自1950年代以来,就一直认为中文文字处理比英文文字处理更难。但是,本文揭示了EWN和中文短语网络(CPN)在出现时具有惊人的相似性。如果正确地存储和处理中文短语,其搜索率可能与英语相同。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号