首页> 外国专利> Identifying language of origin for words using estimates of normalized appearance frequency

Identifying language of origin for words using estimates of normalized appearance frequency

机译:使用归一化出现频率的估计来识别单词的来源语言

摘要

The language of origin of a word or named entity is predicted using estimates of frequency of occurrence of the word or named entity in different languages. In one embodiment, the normalized frequency of occurrence of the word or named entity in a variety of different languages is estimated and the values are used as features in a feature vector which is scored and used to identify language of origin.
机译:单词或命名实体的起源语言是通过使用不同语言的单词或命名实体出现频率的估计来预测的。在一个实施例中,估计了多种不同语言中的单词或命名实体的标准化出现频率,并且将这些值用作特征向量中的特征,该特征向量被打分并用于识别起源语言。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号