A Unified Algorithm for Identification of Various Tabular Structures from Document Images

Sekhar Mandal; Amit K. Das; Partha Bhowmick; Bhabatosh Chanda

首页> 外文期刊>International journal of digital library systems >A Unified Algorithm for Identification of Various Tabular Structures from Document Images

【24h】

A Unified Algorithm for Identification of Various Tabular Structures from Document Images

机译：用于从文档图像中识别各种表格结构的统一算法

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper presents a unified algorithm for segmentation and identification of various tabular structures from document page images. Such tabular structures include conventional tables and displayed math-zones, as well as Table of Contents (TOC) and Index pages. After analyzing the page composition, the algorithm initially classifies the input set of document pages into tabular and non-tabular pages. A tabular page contains at least one of the tabular structures, whereas a non-tabular page does not contain any. The appmach is unified in the sense that it is able to identify all tabular structures from a tabular page, which leads to a considerable simplification of document image segmentation in a novel manner. Such unification also results in speeding up the segmentation process, because the existing methodologies produce time-consuming solutions for treating different tabular structures as separate physical entities. Distinguishing features of different kinds of tabular structures have been used in stages in order to ensure the simplicity and efficiency of the algorithm and demonstrated by exhaustive experimental results.

机译：本文提出了一种用于从文档页面图像中分割和识别各种表格结构的统一算法。这样的表格结构包括常规表格和显示的数学区域，以及目录（TOC）和索引页面。在分析页面组成之后，该算法首先将文档页面的输入集分类为表格和非表格页面。表格页面包含至少一个表格结构，而非表格页面不包含任何表格结构。 appmach在某种意义上是统一的，因为它能够从表格页面识别所有表格结构，从而以新颖的方式大大简化了文档图像分割。这种统一还导致加速了分割过程，因为现有的方法学产生了耗时的解决方案来将不同的表格结构视为单独的物理实体。为了确保算法的简单性和效率，已经分阶段使用了不同类型的表格结构的区别特征，并通过详尽的实验结果进行了证明。

著录项

来源
《International journal of digital library systems》 |2011年第6期|p.27-54|共28页
作者
Sekhar Mandal; Amit K. Das; Partha Bhowmick; Bhabatosh Chanda;
展开▼
作者单位

Bengal Engineering and Science University, Shibpur, India;

Bengal Engineering and Science University, Shibpur, India;

Indian Institute of Technology Kharagpur, India;

Indian Statistical Institute, Kolkata, India;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
document image segmentation; index page detection; math-zone detection; table detection; tabular structures; TOC detection;

机译：文档图像分割;索引页检测;数学区域检测;表检测;板状结构;TOC检测;

相似文献

外文文献
中文文献
专利

1. Fourier-Mellin registration of line-delineated tabular document images [J] . Luke A. D. Hutchison, William A. Barrett International Journal on Document Analysis and Recognition . 2006,第2a3期

机译：线描表格文档图像的傅里叶-梅林配准
2. Holistic design for deep learning-based discovery of tabular structures in datasheet images [J] . Ertugrul Kara, Mark Traquair, Murat Simsek, Engineering Applications of Artificial Intelligence . 2020,第Apra期

机译：基于深度学习的整体设计，可发现数据表图像中的表格结构
3. A unified framework for improving the accuracy of all holistic face identification algorithms Electoral College for human face identification by computing machinery [J] . Liang Chen, Naoyuki Tokuda Artificial Intelligence Review: An International Science and Engineering Journal . 2010,第1a2期

机译：一个提高所有整体人脸识别算法准确性的统一框架选举学院通过计算机器进行人脸识别
4. A Complete System for Detection and Identification of Tabular Structures from Document Images [C] . S. Mandal, S.P. Chowdhury, A.K. Das, International Conference on Image Analysis and Recognition(ICIAR 2004) pt.2; 20040929-1001; Porto(PT) . 2004

机译：从文档图像中检测和识别表格结构的完整系统
5. Camera/projector-based document/object capture system using structured light: Reflectance map image quality assessment and design of structured light patterns and analysis algorithms. [D] . Lei, Yang. 2014

机译：使用基于结构化光的基于摄像机/投影仪的文档/物体捕获系统：反射率图图像质量评估以及结构化光图案和分析算法的设计。
6. A Unified Level Set Framework Combining Hybrid Algorithms for Liver and Liver Tumor Segmentation in CT Images [O] . Zhou Zheng, Xuechang Zhang, Huafei Xu, -1

机译：结合混合算法的统一水平集框架用于CT图像中的肝和肝肿瘤分割
7. Detection and identification of elliptical structure arrangements in images: theory and algorithms [O] . Pătrăucean Viorica 2012

机译：图像中椭圆结构排列的检测和识别：理论和算法
8. New Graph Models and Algorithms for Detecting Salient Structures from Cluttered Images [R] . Wang, S. 2010

机译：用于从杂波图像中检测显着结构的新图模型和算法

A Unified Algorithm for Identification of Various Tabular Structures from Document Images

摘要

著录项

相似文献

相关主题

期刊订阅