首页> 外文期刊>Malaysian Journal of Computer Science >Adaptive Histogram Analysis for Scene Text Binarization and Recognition
【24h】

Adaptive Histogram Analysis for Scene Text Binarization and Recognition

机译:场景文本二值化和识别的自适应直方图分析

获取原文
           

摘要

Scene text binarization and recognition is a challenging task due to different appearance of text in clutter background and uneven illumination in natural scene images. In this paper, we present a new method based on adaptive histogram analysis for each sliding window over a word of a text line detected by the text detection method. The histogram analysis works on the basis that intensity values of text pixels in each sliding window have uniform color. The method segments the words based on region growing which studies spacing between words and characters. Then we propose to use existing OCRs such as ABBYY and Tesseract (Google) to recognize the text line at word and character levels to validate the binarization results. The method is compared with well-known global thresholding technique of binarization to show its effectiveness.
机译:场景文本二值化和识别是一项具有挑战性的任务,这是因为在杂乱背景中文本的外观不同以及自然场景图像中的照明不均匀。在本文中,我们针对文本检测方法检测到的文本行单词上的每个滑动窗口,提出了一种基于自适应直方图分析的新方法。直方图分析是基于每个滑动窗口中文本像素的强度值具有统一的颜色而进行的。该方法基于研究单词和字符之间的间距的区域增长对单词进行分割。然后,我们建议使用现有的OCR(例如ABBYY和Tesseract(Google))在单词和字符级别识别文本行,以验证二值化结果。将该方法与著名的二值化全局阈值技术进行比较,以证明其有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号