首页> 外文期刊>Literary & linguistic computing >Latin word stemming using Wiktionary
【24h】

Latin word stemming using Wiktionary

机译:使用维基词典词干的拉丁词

获取原文
获取原文并翻译 | 示例
           

摘要

This article demonstrates how to automatically build a Latin word stemmer to transform words into their grammatical roots. By using the Wiktionary database as source data, it becomes possible to build such a tool with several hundreds of thousands of words. Our experiments demonstrate that it can then be used to correctly find the root of 78% of the words of Martial's Epigrams, and can be combined with other linguistic tools such as the Latin WordNet to greatly enhance their language coverage. While our research focuses on the Latin language, the same methodology could be used to build stemmers and other linguistic tools for many other ancient languages represented in Wiktionary, such as Ancient Greek or Old Armenian.
机译:本文演示了如何自动构建拉丁语词干分析器,以将单词转换为其语法根源。通过将Wiktionary数据库用作源数据,可以构建具有数十万个单词的工具。我们的实验表明,它可以用来正确地找到武术的Epigrams的78%单词的词根,并且可以与其他语言工具(例如拉丁语WordNet)结合使用,以大大增强其语言覆盖范围。虽然我们的研究集中在拉丁语言上,但可以使用相同的方法为Wiktionary中代表的许多其他古代语言(例如古希腊语或亚美尼亚语)构建词干分析器和其他语言工具。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号