首页> 外文会议>International Conference on Computational Linguistics and Intelligent Text Processing(CICLing 2006); 20060219-25; Mexico City(MX) >Towards the Automatic Lemmatization of 16th Century Mexican Spanish: A Stemming Scheme for the CHEM
【24h】

Towards the Automatic Lemmatization of 16th Century Mexican Spanish: A Stemming Scheme for the CHEM

机译:迈向16世纪墨西哥西班牙语的自动放血:CHEM的阻止计划

获取原文
获取原文并翻译 | 示例

摘要

Two of the problems that should arise when developing a stemming scheme for diachronic corpora are: (1) morphological systems of natural languages may vary throughout time, and these changes are normally not documented sufficiently; and (2) they exhibit very diverse orthographic characteristics. In this short paper, a stemming strategy for a diachronic corpus of Mexican Spanish is briefly described, which partially faces up to these problems. Success rates of the method are contrasted to those of a Porter stemmer.
机译:为历时性语料库开发词干方案时应该出现的两个问题是:(1)自然语言的词法系统可能会随时间变化,并且这些变化通常没有得到充分记录; (2)它们具有非常多样的正字特征。在这篇简短的文章中,简要描述了墨西哥西班牙语历时语料库的词干策略,部分策略正视这些问题。该方法的成功率与波特词干分析器的成功率形成对比。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号