首页> 外文会议>2012 IEEE Student Conference on Research and Development >Optical character recognition of arabic printed text
【24h】

Optical character recognition of arabic printed text

机译:阿拉伯文字印刷的光学字符识别

获取原文
获取原文并翻译 | 示例

摘要

Optical character recognition (OCR) systems improve human- machine interaction. They are widely used in many areas such as editing and storing previously printed or handwritten documents. Much of research has been done regarding the identification of Latin, Japanese and Chinese characters. However, very little investigation has been performed regarding Arabic recognition. Probably the reason is limitation of IT activities in Arabic speaking countries and the difficulty and complexity of Arabic characters identification compared to the others. More difficulties are introduced from the cursive nature of Arabic text. In this paper, a technique has been employed to segment printed Arabic text in order to separate the Arabic characters and then extracting powerful features for each to be recognized. In-order to recognize characters, those features are then compared with a pre-prepared database fields. Although the database was prepared from characters written in Time New Roman font, experimental results show the relatively high accuracy of the method developed when it is tested on several sizes of several fonts beside Time New Roman font.
机译:光学字符识别(OCR)系统改善了人机交互。它们广泛用于许多领域,例如编辑和存储先前打印或手写的文档。关于拉丁,日文和中文字符的识别已经进行了许多研究。但是,关于阿拉伯语识别的研究很少。可能的原因是阿拉伯语国家/地区的IT活动受到限制,以及与其他国家/地区相比,阿拉伯字符识别的难度和复杂性。从阿拉伯文本的草书性质引入了更多的困难。在本文中,已采用一种技术来分割打印的阿拉伯文本,以便分离阿拉伯字符,然后为每个要识别的字符提取强大的功能。为了识别字符,然后将这些功能与预先准备的数据库字段进行比较。尽管数据库是用Time New Roman字体编写的字符准备的,但是实验结果表明,当对Time New Roman字体旁边的几种字体的几种大小进行测试时,所开发方法的准确性相对较高。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号