首页> 外文会议>9th International conference on language resources and evaluation >Finite-state morphological transducers for three Kypchak languages
【24h】

Finite-state morphological transducers for three Kypchak languages

机译:用于三种KYPCHAK语言的有限状态形态传感器

获取原文

摘要

This paper describes the development of free/open-source finite-state morphological transducers for three Turkic languages-Kazakh, Tatar, and Kumyk-representing one language from each of the three commonly distinguished sub-branches of the Kypchak branch of Turkic. The finite-state toolkit used for the work is the Helsinki Finite-State Toolkit (HFST). This paper describes how the development of a transducer for each subsequent closely-related language took less development time. An evaluation is presented which shows that the transducers all have a reasonable coverage-around 90%-on freely available corpora of the languages, and high precision over a manually verified test set.
机译:本文介绍了用于三个突厥语哈萨克,塔塔尔和Kumyk的自由/开源有限状态形态传感器的开发,代表突厥的三个常见的众所周知的三个常见的子分支中的一种语言。用于工作的有限状态工具包是赫尔辛基有限状态工具包(HFST)。本文介绍了如何为每个随后相关的语言开发传感器的开发少开发时间。提出了一种评估,表明传感器所有合理的覆盖范围约为90% - 在手动验证的测试集中自由地获得了大约90%的语言的高精度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号