...
首页> 外文期刊>Physica, A. Statistical mechanics and its applications >Entropy analysis of natural language written texts
【24h】

Entropy analysis of natural language written texts

机译:自然语言文字的熵分析

获取原文
获取原文并翻译 | 示例
           

摘要

The aim of the present work is to investigate the relative contribution of ordered and stochastic components in natural written texts and examine the influence of text category and language on these. To this end, a binary representation of written texts and the generated symbolic sequences are examined by the standard block entropy analysis and the Shannon and Kolmogorov entropies are obtained. It is found that both entropies are sensitive to both language and text category with the text category sensitivity to follow almost the same trends in both languages (English and Greek) considered. The values of these entropies are compared with those of stochastically generated symbolic sequences and the nature of correlations present in this representation of real written texts is identified.
机译:本工作的目的是调查自然书面文本中有序和随机成分的相对贡献,并检验文本类别和语言对这些影响。为此,通过标准块熵分析检查了书面文本和生成的符号序列的二进制表示形式,并获得了Shannon和Kolmogorov熵。发现两个熵对语言和文本类别都敏感,而文本类别敏感度遵循所考虑的两种语言(英语和希腊语)几乎相同的趋势。将这些熵的值与随机生成的符号序列的值进行比较,并确定存在于这种真实书面文本中的相关性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号