A language model using variable length tokens for open-vocabulary Hangul text recognition

Ryu SH; Kim JH

首页> 外文期刊>Pattern Recognition: The Journal of the Pattern Recognition Society >A language model using variable length tokens for open-vocabulary Hangul text recognition

【24h】

A language model using variable length tokens for open-vocabulary Hangul text recognition

机译：使用可变长度标记进行开放式韩文文本识别的语言模型

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

We propose a novel language model for Hangul text recognition. Without relying on prior linguistic knowledge in training, the proposed model learns variable length Hangul character sequences, which comprise the elementary tokens of Korean language, and their probabilities from statistics of a raw text corpus. Experiments in handwritten Hangul recognition shows that the proposed language model is effective in postprocessing of recognition results. (C) 2003 Pattern Recognition Society. Published by Elsevier Ltd. All rights reserved.

著录项

来源
《Pattern Recognition: The Journal of the Pattern Recognition Society》 |2004年第7期|共4页
作者
Ryu SH; Kim JH;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类自动化技术及设备;
关键词
language model; character recognition; hangul recognition; open-vocabulary; word recognition; UNITS;

机译：语言模型;字符识别;韩语识别;词汇开放;单词识别;UNITS;

相似文献

外文文献
中文文献
专利

1. A language model using variable length tokens for open-vocabulary Hangul text recognition [J] . Ryu SH, Kim JH Pattern Recognition: The Journal of the Pattern Recognition Society . 2004,第7期

机译：使用可变长度标记进行开放式韩文文本识别的语言模型
2. Integration Of N-gram Language Models Inmultiple Classifier Systems For Offline handwritten Text Line Recognition [J] . ROMAN BERTOLAMI, HORST BUNKE International Journal of Pattern Recognition and Artificial Intelligence . 2008,第7期

机译：N-gram语言模型在多个分类器系统中的集成，用于离线手写文本行识别
3. Unsupervised language model adaptation for handwritten Chinese text recognition [J] . Qiu-Feng Wang, Fei Yin, Cheng-Lin Liu Pattern Recognition: The Journal of the Pattern Recognition Society . 2014,第3期

机译：手写中文识别的无监督语言模型自适应
4. Improvement in Performance of Tamil Phoneme Recognition using Variable Length and Hybrid Language Models [C] . S. Saraswathi, T. V. Geetha International Conference of Signal Processing, Communications and Networking . 2007

机译：使用可变长度和混合语言模型改进泰米尔音素识别的性能
5. Exposing the Importance of Hidden Pronunciations in Hangul from the Listener’s Perspective – An Investigation of Korean as a Foreign Language [D] . ?Gagnon, Steven Garrison 2020

机译：韩国的调查研究作为一门外语 - 从听众的角度揭露隐藏的发音在韩语中的重要性
6. Morpheme Matching Based Text Tokenization for a Scarce Resourced Language [O] . Zobia Rehman, Waqas Anwar, Usama Ijaz Bajwa, -1

机译：一种基于语素匹配的文本标记用于一种稀缺资源语言
7. Arabic text recognition of printed manuscripts. Efficient recognition of off-line printed Arabic text using Hidden Markov Models, Bigram Statistical Language Model, and post-processing. [O] . Al-Muhtaseb Husni Abdulghani 2010

机译：印刷品的阿拉伯文字识别。使用隐马尔可夫模型，Bigram统计语言模型和后处理可有效识别离线印刷的阿拉伯文本。

A language model using variable length tokens for open-vocabulary Hangul text recognition

摘要

著录项

相似文献

相关主题

期刊订阅