首页> 外文期刊>Pattern Recognition: The Journal of the Pattern Recognition Society >A language model using variable length tokens for open-vocabulary Hangul text recognition
【24h】

A language model using variable length tokens for open-vocabulary Hangul text recognition

机译:使用可变长度标记进行开放式韩文文本识别的语言模型

获取原文
获取原文并翻译 | 示例
       

摘要

We propose a novel language model for Hangul text recognition. Without relying on prior linguistic knowledge in training, the proposed model learns variable length Hangul character sequences, which comprise the elementary tokens of Korean language, and their probabilities from statistics of a raw text corpus. Experiments in handwritten Hangul recognition shows that the proposed language model is effective in postprocessing of recognition results. (C) 2003 Pattern Recognition Society. Published by Elsevier Ltd. All rights reserved.
机译:我们提出了一种用于韩文文本识别的新颖语言模型。所提出的模型在不依赖训练之前的语言知识的情况下,学习了可变长度的韩文字符序列,该序列包含韩语的基本标记及其从原始文本语料库的统计中得出的概率。手写韩文识别实验表明,所提出的语言模型对识别结果进行后处理是有效的。 (C)2003模式识别学会。由Elsevier Ltd.出版。保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号