Text Extraction from Images in the Wild Using the Viola-Jones Algorithm

机译：使用Viola-Jones算法从野外图像中提取文本

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Text Localization and extraction is an important issue in modern applications of computer vision. Applications such as reading and translating texts in the wild or from videos are among the many applications that can benefit results of this field. In this work, we adopt the well-known Viola-Jones algorithm to enable text extraction and localization from images in the wild. The Viola-Jones is an efficient, and a fast image-processing algorithm originally used for face detection. Based on some resemblance between text and face detection tasks in the wild, we have modified the viola-jones to detect regions of interest where text may be localized. In the proposed approach, some modification to the H AAR like features and a semi-automatic process of data set generating and manipulation were presented to train the algorithm. A process of sliding windows with different sizes have been used to scan the image for individual letters and letter clusters existence. A post processing step is used in order to combine the detected letters into words and to remove false positives. The novelty of the presented approach is using the strengths of a modified Viola-Jones algorithm to identify many different objects representing different letters and clusters of similar letters and later combine them into words of varying lengths. Impressive results were obtained on the ICDAR contest data sets.

机译：在计算机视觉的现代应用中，文本本地化和提取是一个重要的问题。诸如从野外或从视频中阅读和翻译文本之类的应用程序都是可以使该领域的结果受益的众多应用程序之一。在这项工作中，我们采用了著名的Viola-Jones算法，可以从野外图像中提取文本并进行本地化。 Viola-Jones是一种高效且快速的图像处理算法，最初用于面部检测。基于文本和野外面部检测任务之间的某些相似之处，我们对中提琴琼斯进行了修改，以检测可能定位了文本的感兴趣区域。在提出的方法中，提出了对H AAR类特征的一些修改以及数据集生成和处理的半自动过程，以训练该算法。已使用具有不同大小的滑动窗口的过程来扫描图像以查找单个字母和字母簇的存在。使用后处理步骤是为了将检测到的字母组合成单词并消除误报。所提出的方法的新颖性是利用改进的Viola-Jones算法的优势来识别代表不同字母和相似字母簇的许多不同对象，然后将它们组合成不同长度的单词。在ICDAR竞赛数据集上获得了令人印象深刻的结果。

著录项

来源
《International conference on graphic and image processing》|2017年|106151H.1-106151H.10|共10页
会议地点
作者
Raid M. Saabni; Eran Zingboim;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Text Localization; Viola-Jones; Ada-Boost; HAAR like Features;

机译：文字本地化;中提琴-琼斯Ada-Boost;类似于HAAR的功能;

相似文献

外文文献
中文文献
专利

1. Review of Text Extraction Algorithms for Scene-text and Document Images [J] . Sahare Parul, Dhok Sanjay B. IETE Technical Review . 2017,第2期

机译：场景文本和文档图像的文本提取算法综述
2. Eye Feature Extraction with Calibration Model using Viola-Jones and Neural Network Algorithms [J] . Farah Nadia Ibrahim, Zalhan Mohd Zin Norazlin Ibrahim Advances in Science, Technology and Engineering Systems . 2019,第6期

机译：使用VITEA-JONES和神经网络算法用校准模型提取眼特征提取
3. An Application of an EM Algorithm for Skew Detection of Signatures in Text Images: Signature Extraction From Images [J] . International journal of computer vision and iImage processing . 2019,第4期

机译：EM算法在文本图像签名偏斜检测中的应用：从图像中提取签名
4. Text Extraction from Images in the Wild Using the Viola-Jones Algorithm [C] . Raid M. Saabni, Eran Zingboim International Conference on Graphic and Image Processing . 2017

机译：使用中提琴jones算法从野外图像中提取图片
5. Data extraction from text using wild card queries. [D] . Li, Haobin. 2006

机译：使用通配符查询从文本中提取数据。
6. Validation of the Total Visual Acuity Extraction Algorithm (TOVA) for Automated Extraction of Visual Acuity Data From Free Text Unstructured Clinical Records [O] . Douglas M. Baughman, Grace L. Su, Irena Tsui, -1

机译：从自由文本非结构化临床记录中自动提取视敏度数据的总视敏度提取算法（TOVA）的验证
7. Eye Feature Extraction with Calibration Model using Viola-Jones and Neural Network Algorithms [O] . Farah Nadia Ibrahim, Zalhan Mohd Zin, Norazlin Ibrahim 2019

机译：使用VITEA-JONES和神经网络算法用校准模型提取眼特征提取

Text Extraction from Images in the Wild Using the Viola-Jones Algorithm

摘要

著录项

相似文献

相关主题

期刊订阅