【24h】

A Document Image Preprocessing System for Keyword Spotting

机译:用于关键词识别的文档图像预处理系统

获取原文
获取原文并翻译 | 示例

摘要

This paper presents a system for the segmentation of a printed document image into word images, which can be used effectively for document image retrieval based on keyword spotting. The system is composed of three image manipulation modules: skew correction, document layout analysis, and word segmentation. To enhance the practical applicability and flexibility of our research results, we test the system with 50 images of Korean papers and 50 images of English papers provided through full-text image retrieval services by the Korea Information Science Society and the Pattern Recognition Society, respectively. Currently, the accuracy of word extraction ranges from 90 to 95%, depending on the language of the document.
机译:本文提出了一种用于将打印文档图像分割为单词图像的系统,该系统可有效地用于基于关键词点标的文档图像检索。该系统由三个图像处理模块组成:偏斜校正,文档布局分析和单词分割。为了提高研究结果的实用性和灵活性,我们分别通过由韩国信息科学学会和模式识别学会提供的全文检索服务提供的50张韩国论文图像和50张英文论文图像对系统进行了测试。当前,取决于文档的语言,单词提取的准确性范围为90%到95%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号