...
首页> 外文期刊>The quarterly journal of experimental psychology: QJEP >Estimating the prevalence and diversity of words in written language
【24h】

Estimating the prevalence and diversity of words in written language

机译:估计书面语言中的单词的普遍性和多样性

获取原文
获取原文并翻译 | 示例
           

摘要

Recently, a new crowd-sourced language metric has been introduced, entitled word prevalence, which estimates the proportion of the population that knows a given word. This measure has been shown to account for unique variance in large sets of lexical performance. This article aims to build on the work of Brysbaert et al. and Keuleers et al. by introducing new corpus-based metrics that estimate how likely a word is to be an active member of the natural language environment, and hence known by a larger subset of the general population. This metric is derived from an analysis of a newly collected corpus of over 25,000 fiction and non-fiction books and will be shown that it is capable of accounting for significantly more variance than past corpus-based measures.
机译:最近,已经介绍了一种新的人群源语言指标,题为普遍存在的词语,这估计了了解给定词的人口的比例。 该措施已被证明考虑了大型词汇表现的独特方差。 本文旨在建立Brysbaert等人的工作。 和Keuleers等。 通过介绍新的基于语料库的度量标准,估计单词是自然语言环境的活动成员的可能性,因此通过普通群体的更大子集已知。 该度量来自分析新收集的超过25,000个小说和非小说书籍的语料库,并将表明它能够考虑比过去的基于语料库的措施更大的方差。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号