首页> 外文会议>International conference on omputational science and technology >The Development of an Integrated Corpus for Malay Language
【24h】

The Development of an Integrated Corpus for Malay Language

机译:马来语语言的综合语料库的开发

获取原文

摘要

Generally, a corpus serves as the source of data for various types of research. As such, there are several Malay corpora being developed to support the needs of the researchers. However, the various corpora of Malay text are distributed and not integrated, where some words are not included or missing in some corpora. The focus of this paper is to develop an integrated corpus that will combine four most comprehensive Malay corpora. The intention is to provide comprehensive coverage of Malay corpora which would be beneficial for any relevant work.
机译:通常,语料库用作各种研究的数据源。因此,有几个Malay Corpora正在开发支持研究人员的需求。但是,马来文文本的各种语料库都是分布式的,没有集成,其中一些词汇不包括在内或缺少一些词汇。本文的重点是开发一个集成的语料库,将结合四个最全面的马来集团。目的是提供马来集团的全面覆盖,这将有利于任何相关工作。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号