首页> 外文期刊>Chinese Journal of Electronics >Rejecting Character Recognition Errors Using CNN Based Confidence Estimation
【24h】

Rejecting Character Recognition Errors Using CNN Based Confidence Estimation

机译:使用基于CNN的置信度估计来拒绝字符识别错误

获取原文
获取原文并翻译 | 示例
           

摘要

Although Optical character recognition (OCR) technology has achieved huge progress in recent years, character misrecognition is inevitable. In order to realize high fidelity content of document digitalization, we propose a new Convolutional neural networks (CNN) based confidence estimation method. We detect the misrecognized characters through comparing the confidence value with a preset threshold, so as to leave the recognition errors as embedded images in the output digital documents. We adopted sofmax as the estimation of posteriori probability, overlap pooling and maxout with dropout technologies in CNN architecture design. Experimental results show that our method has achieved an explicit improvement compared to baseline system.
机译:尽管光学字符识别(OCR)技术近年来已经取得了巨大的进步,但是字符错误识别是不可避免的。为了实现文档数字化的高保真内容,我们提出了一种新的基于卷积神经网络(CNN)的置信度估计方法。我们通过将置信度值与预设阈值进行比较来检测无法识别的字符,从而将识别错误作为嵌入图像保留在输出数字文档中。在CNN架构设计中,我们采用sofmax作为后验概率,重叠池和maxout以及dropout技术的估计。实验结果表明,与基线系统相比,我们的方法取得了明显的改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号