首页> 外文会议>International conference on machine vision >Generation method of synthetic training data for mobile OCR system
【24h】

Generation method of synthetic training data for mobile OCR system

机译:移动ocr系统综合训练数据的生成方法

获取原文

摘要

This paper addresses one of the fundamental problems of machine learning - training data acquiring. Obtaining enough natural training data is rather difficult and expensive. In last years usage of synthetic images has become more beneficial as it allows to save human time and also to provide a huge number of images which otherwise would be difficult to obtain. However, for successful learning on artificial dataset one should try to reduce the gap between natural and synthetic data distributions. In this paper we describe an algorithm which allows to create artificial training datasets for OCR systems using russian passport as a case study.
机译:本文解决了机器学习的基本问题之一-训练数据获取。获得足够的自然训练数据是相当困难且昂贵的。近年来,合成图像的使用变得更加有益,因为它可以节省人的时间并提供大量的图像,否则这些图像将很难获得。但是,为了在人工数据集上成功学习,应该尝试缩小自然数据和合成数据分布之间的差距。在本文中,我们描述了一种算法,该算法允许使用俄罗斯护照为案例研究为OCR系统创建人工训练数据集。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号