首页> 外文会议>International Conference on Inventive Computing and Informatics >A simple system for table extraction irrespective of boundary thickness and removal of detected spurious lines
【24h】

A simple system for table extraction irrespective of boundary thickness and removal of detected spurious lines

机译:用于表提取的简单系统,无论边界厚度和检测到的杂散线的去除如何

获取原文

摘要

Several types of table layout structures are ubiquitous in digitalized document images and are characterized by their row and column separators. Document image may consist of several undesirable lines introduced due to improper scanning, crease formation, accidental remarks etc., in addition to the desired lines in tables. Since tables being an effective component of document images for representing an information, one needs to extract the table from the document images. The proposed method aims at removing unwanted straight lines in binary document images, without affecting the essential details of the table by a two-step process. In the first step, the extraction of necessary details of the tables containing lines as row and column separators along with their respective frames is performed using Mask Processing. The second step involves the detection and removal of all straight lines using a Pseudo Diagonal Image (PDI) and its rotation. The proposed method exploits the novelty in utilizing a single mask for the detection of tables instead of multiple masks, hence the computational complexity for processing is lesser. Independency in the thickness of table boundary while extraction is also an effective characterization of the proposed algorithm. The Obtained result shows 93.35% precision and 92.33% recall.
机译:在数字化文档图像中,几种类型的表布局结构是普遍存在的,其特征在于它们的行和柱分离器。除了表格中的所需线之外,文档图像可以包括由于扫描,折痕形成,意外言论等而引入的几条不良线。由于表是用于表示信息的文档图像的有效组件,因此需要从文档图像中提取表。所提出的方法旨在在二进制文件图像中去除不需要的直线,而不会通过两步过程影响表的基本细节。在第一步中,使用掩模处理执行作为行和柱分离器的包含行的表的必要细节以及它们各自的帧。第二步涉及使用伪对角线图像(PDI)及其旋转来检测和移除所有直线。所提出的方法利用单个掩模来利用用于检测表而不是多个掩模的单个掩模来利用新颖性,因此处理的计算复杂度是较小的。在提取的同时表边界厚度的独立性也是所提出的算法的有效表征。所获得的结果显示93.35 %精度和92.33 %召回。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号