首页> 外文会议>2012 Third International Conference on Computing Communication amp; Networking Technologies. >Text line extraction from handwritten document pages based on line contour estimation
【24h】

Text line extraction from handwritten document pages based on line contour estimation

机译:基于线条轮廓估计的手写文档页面文本行提取

获取原文
获取原文并翻译 | 示例

摘要

Extraction of text lines from handwritten/printed document images is one of the important steps in the process of an Optical Character Recognition (OCR) system. In case of handwritten document images, presence of skewed, touching or overlapping text line(s) makes this process a real challenge to the researcher. In the present work, a new text line extraction technique based on line contour estimation is reported. Here, digitized document image is initially partitioned into a number of vertical fragments of equal width. Then all the line segments present in these vertical fragments are detected. Finally, the neighboring line segments are analyzed to place them inside the line boundary in which they actually belong. For experimental purpose, the developed technique is tested on CMATERdb1.2.1 database and present technique extracts 88.44% text lines successfully.
机译:从手写/打印的文档图像中提取文本行是光学字符识别(OCR)系统过程中的重要步骤之一。对于手写文档图像,歪斜,触摸或重叠的文本行的存在使该过程成为研究人员的真正挑战。在当前的工作中,报告了一种新的基于行轮廓估计的文本行提取技术。在这里,数字化文档图像最初被划分为多个等宽的垂直片段。然后检测这些垂直片段中存在的所有线段。最后,分析相邻的线段以将其放置在它们实际所属的线边界内。出于实验目的,在CMATERdb1.2.1数据库上测试了开发的技术,该技术成功提取了88.44%的文本行。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号