Real-Time Lexicon-Free Scene Text Localization and Recognition

Lukáš Neumann; Jiří Matas

首页> 外文期刊>IEEE Transactions on Pattern Analysis and Machine Intelligence >Real-Time Lexicon-Free Scene Text Localization and Recognition

【24h】

Real-Time Lexicon-Free Scene Text Localization and Recognition

机译：实时无词典场景文本本地化和识别

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

An end-to-end real-time text localization and recognition method is presented. Its real-time performance is achieved by posing the character detection and segmentation problem as an efficient sequential selection from the set of Extremal Regions. The ER detector is robust against blur, low contrast and illumination, color and texture variation. In the first stage, the probability of each ER being a character is estimated using features calculated by a novel algorithm in constant time and only ERs with locally maximal probability are selected for the second stage, where the classification accuracy is improved using computationally more expensive features. A highly efficient clustering algorithm then groups ERs into text lines and an OCR classifier trained on synthetic fonts is exploited to label character regions. The most probable character sequence is selected in the last stage when the context of each character is known. The method was evaluated on three public datasets. On the ICDAR 2013 dataset the method achieves state-of-the-art results in text localization; on the more challenging SVT dataset, the proposed method significantly outperforms the state-of-the-art methods and demonstrates that the proposed pipeline can incorporate additional prior knowledge about the detected text. The proposed method was exploited as the baseline in the ICDAR 2015 Robust Reading competition, where it compares favourably to the state-of-the art.

机译：提出了一种端到端的实时文本定位与识别方法。它的实时性能是通过将字符检测和分割问题视为从极值区域集中进行的有效顺序选择来实现的。 ER检测器可抵抗模糊，低对比度和照明，颜色和纹理变化。在第一阶段，使用新颖算法在恒定时间内计算出的特征来估计每个ER为字符的概率，而在第二阶段中仅选择局部概率最大的ER，在此阶段，使用计算上更昂贵的特征可以提高分类精度。然后，一种高效的聚类算法将ER分组为文本行，并利用在合成字体上训练的OCR分类器来标记字符区域。当知道每个字符的上下文时，在最后阶段选择最可能的字符序列。该方法在三个公共数据集上进行了评估。在ICDAR 2013数据集上，该方法可实现文本本地化的最新结果；在更具挑战性的SVT数据集上，提出的方法明显优于最新方法，并证明提出的管道可以结合有关检测到的文本的其他先验知识。拟议的方法被用作ICDAR 2015年稳健阅读竞赛的基准，在该技术中，该方法可与最新技术相媲美。

著录项

来源
《IEEE Transactions on Pattern Analysis and Machine Intelligence》 |2016年第9期|1872-1885|共14页
作者
Lukáš Neumann; Jiří Matas;
展开▼
作者单位

Department of Cybernetics, Centre for Machine Perception, Czech Technical University, Praha, Czech Republic;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Text-in-the wild; end-to-end text recognition; photo OCR; scene text;

机译：野外文本;端到端文本识别;照片OCR;场景文本;

相似文献

外文文献
中文文献
专利

1. Real-time localization of multi-oriented text in natural scene images using a linear spatial filter [J] . Girones Xavier, Julia Carme Journal of Real-Time Image Processing . 2020,第5期

机译：使用线性空间滤波器的自然场景图像中多面文本的实时定位
2. Text detection and localization in natural scene images based on text awareness score [J] . Soni Rituraj, Kumar Bijendra, Chand Satish Applied Intelligence: The International Journal of Artificial Intelligence, Neural Networks, and Complex Problem-Solving Technologies . 2019,第4期

机译：基于文本意识分数的自然场景图像中的文本检测与本地化
3. An effective graph-cut scene text localization with embedded text segmentation [J] . Liu Xiaoqian, Wang Weiqiang Multimedia Tools and Applications . 2015,第13期

机译：具有嵌入式文本分割的有效的图割场景文本本地化
4. Real-time scene text localization and recognition [C] . Neumann Lukas, Matas Jiri Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on . 2012

机译：实时场景文本本地化和识别
5. Unified detection and recognition for reading text in scene images [D] . Weinman, Jerod J. 2008

机译：统一检测和识别以读取场景图像中的文本
6. Cursive-Text: A Comprehensive Dataset for End-to-End Urdu Text Recognition in Natural Scene Images [O] . Asghar Ali Chandio, Md. Asikuzzaman, Mark Pickering, 2020

机译：草书文本：用于自然场景图像中端到端乌尔都语文本识别的综合数据集
7. 1Real-time Lexicon-free Scene Text Localization and Recognition [O] . 2016

机译：1实时无词典场景文本定位和识别

Real-Time Lexicon-Free Scene Text Localization and Recognition

摘要

著录项

相似文献

相关主题

期刊订阅