首页> 外文会议>9th International conference on language resources and evaluation >A set of open-source tools for Turkish natural language processing
【24h】

A set of open-source tools for Turkish natural language processing

机译:一套用于土耳其自然语言处理的开源工具

获取原文

摘要

This paper introduces a set of freely available, open-source tools for Turkish that are built around TRmorph, a morphological analyzer introduced earlier in Coeltekin (2010a). The article first provides an update on the analyzer, which includes a complete rewrite using a different finite-state description language and tool set as well as major tagset changes to comply better with the state-of-the-art computational processing of Turkish and the user requests received so far. Besides these major changes to the analyzer, this paper introduces tools for morphological segmentation, stemming and lemmatization, guessing unknown words, grapheme to phoneme conversion, hyphenation and a morphological disambiguation.
机译:本文介绍了一套可自由的,用于土耳其语的可自由开源工具,该工具围绕Trmorph建造的土耳其,这是在Coeltekin(2010A)之前介绍的形态分析仪。本文首先在分析仪上提供更新,该分析器包括使用不同的有限状态描述语言和工具集的完整重写,以及主要的TAGSET更改,以更好地通过土耳其语的最先进的计算处理更好地遵守到目前为止收到的用户请求。除了对分析仪的这些重大变化外,本文介绍了形态分割,肿胀和鼠尾化的工具,猜测未知的单词,图形转换转换,连字符和形态歧义。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号