首页> 外文会议>2018 Recent Advances on Engineering, Technology and Computational Sciences >A Novel Segmentation Technique for Urdu Type-Written Text
【24h】

A Novel Segmentation Technique for Urdu Type-Written Text

机译:一种新的乌尔都语书面文字分割技术

获取原文
获取原文并翻译 | 示例

摘要

Text segmentation is a process of subdividing the text image into its constituent parts, such as text lines, words and isolated characters. It is the first module in design of Optical character recognition systems. The problem of automatic text segmentation algorithms is increasingly becoming an important issue. Major problems arise due to the lack of standard dataset, a wide diversity of objectives and a lack of meaningful quantitative evaluation. In this paper a new technique is proposed that segments Urdu type written text into text lines on the basis of edges information of connected components. The performance of this technique is tested over the benchmark data set using precision and recall metric with accuracy of 87.36% and 84.75% respectively. Also data set collection, compilation and organization is a part of this research.
机译:文本分割是将文本图像细分为其组成部分(例如文本行,单词和孤立字符)的过程。它是光学字符识别系统设计中的第一个模块。自动文本分割算法的问题正日益成为重要的问题。由于缺乏标准的数据集,目标的多样性和缺乏有意义的定量评估,出现了主要问题。在本文中,提出了一种新技术,该技术根据连接的组件的边缘信息将Urdu型书面文本分割为文本行。使用精度和召回率指标分别在基准数据集上测试了此技术的性能,其准确度分别为87.36%和84.75%。数据集的收集,编辑和组织也是这项研究的一部分。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号