首页> 外文期刊>Neurocomputing >Deep learning for detection of text polarity in natural scene images
【24h】

Deep learning for detection of text polarity in natural scene images

机译:在自然场景图像中检测文本极性的深度学习

获取原文
获取原文并翻译 | 示例
           

摘要

Text extraction and recognition from natural scene images is a challenging task due to their complex background. It has several computer vision applications like license plate recognition, content based image retrieval, digitization for visually impaired etc. In these images, dark text can be present on a bright background or vice versa and there is an imperative need to determine this polarity for the recognition process. In the present work, we have proposed to use deep learning approaches to determine text polarity. We have used Convolutional Neural Network (CNN) to classify whether a scene image contains dark text on a bright background or vice versa. CNN has been trained on image samples collected from benchmarking datasets like ICDAR, IIIT5K etc. We have also extracted CNN features by removing its final fully connected layers and trained support vector machine (SVM) classifier using these features. Our experiments have shown that this transfer learning approach has given better accuracy than original CNN and the corresponding results are reported. (C) 2020 Elsevier B.V. All rights reserved.
机译:由于他们的复杂背景,自然场景图像的文本提取和识别是一个具有挑战性的任务。它具有许多计算机视觉应用,如牌照识别,基于内容的图像检索,数字化在视觉上受损等。在这些图像中,暗文本可以存在于明亮的背景上,反之亦然,并且必须需要确定这种极性的必要性识别过程。在目前的工作中,我们提出使用深度学习方法来确定文本极性。我们使用了卷积神经网络(CNN)来分类场景图像是否包含明亮的背景上的黑暗文本,反之亦然。 CNN已经在从基准数据集中收集的图像样本培训,如ICDAR,IIIT5K等。我们还通过删除其最终完全连接的图层并使用这些功能培训支持向量机(SVM)分类器来提取CNN特征。我们的实验表明,该转移学习方法具有比原始CNN更好的准确性,并报告了相应的结果。 (c)2020 Elsevier B.v.保留所有权利。

著录项

获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号