Text line extraction from handwritten document pages based on line contour estimation

机译：基于线条轮廓估计的手写文档页面文本行提取

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Extraction of text lines from handwritten/printed document images is one of the important steps in the process of an Optical Character Recognition (OCR) system. In case of handwritten document images, presence of skewed, touching or overlapping text line(s) makes this process a real challenge to the researcher. In the present work, a new text line extraction technique based on line contour estimation is reported. Here, digitized document image is initially partitioned into a number of vertical fragments of equal width. Then all the line segments present in these vertical fragments are detected. Finally, the neighboring line segments are analyzed to place them inside the line boundary in which they actually belong. For experimental purpose, the developed technique is tested on CMATERdb1.2.1 database and present technique extracts 88.44% text lines successfully.

机译：从手写/打印的文档图像中提取文本行是光学字符识别（OCR）系统过程中的重要步骤之一。对于手写文档图像，歪斜，触摸或重叠的文本行的存在使该过程成为研究人员的真正挑战。在当前的工作中，报告了一种新的基于行轮廓估计的文本行提取技术。在这里，数字化文档图像最初被划分为多个等宽的垂直片段。然后检测这些垂直片段中存在的所有线段。最后，分析相邻的线段以将其放置在它们实际所属的线边界内。出于实验目的，在CMATERdb1.2.1数据库上测试了开发的技术，该技术成功提取了88.44％的文本行。

著录项

来源
《2012 Third International Conference on Computing Communication amp; Networking Technologies.》|2012年|p.1- 8|共8页
会议地点 Coimbatore(IN);Coimbatore(IN)
作者
Sarkar Ram; Halder Sougata; Malakar Samir; Das Nibaran; Basu Subhadip; Nasipuri Mita;
展开▼
作者单位

Dept. of Computer Science and Engineering, Jadavpur University, Kolkata, India;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类通信网;通信网;
关键词

相似文献

外文文献
中文文献
专利

1. Text-Line Extraction in Handwritten Chinese Documents Based on an Energy Minimization Framework [J] . Koo H. I., Cho N. I. Image Processing, IEEE Transactions on . 2012,第3期

机译：基于能量最小化框架的手写中文文档文本行提取
2. Coalition game based feature selection for text non-text separation in handwritten documents using LBP based features [J] . Manosij Ghosh, Kushal Kanti Ghosh, Showmik Bhowmik, Multimedia Tools and Applications . 2021,第2期

机译：基于联盟游戏的特征选择，用于使用基于LBP的功能手写文档中的文本非文本分离
3. Text/Non-Text Separation from Handwritten Document Images Using LBP Based Features: An Empirical Study [J] . Sourav Ghosh, Dibyadwati Lahiri, Showmik Bhowmik, Journal of Imaging . 2018,第4期

机译：使用基于LBP的功能从手写文档图像中分离文本/非文本的实证研究
4. Text line extraction from handwritten document pages based on line contour estimation [C] . Sarkar Ram, Halder Sougata, Malakar Samir, International Conference on Computing, Communication and Networking Technologies . 2012

机译：基于线路轮廓估计的手写文档页面文本排列
5. Document image analysis techniques for handwritten text segmentation, document image rectification and digital collation. [D] . Salvi, Dhaval. 2014

机译：用于手写文本分割，文档图像校正和数字整理的文档图像分析技术。
6. ASM Based Synthesis of Handwritten Arabic Text Pages [O] . Laslo Dinges, Ayoub Al-Hamadi, Moftah Elzobi, 2015

机译：基于ASM的阿拉伯语手写文本页面综合
7. Contour vs Non-Contour based Word Segmentation from Handwritten Text Lines: an Experimental Analysis [O] . Fajri Kurniawan*corresponding, Amjad Rehman Khan, Dzulkifli Mohamad 2014

机译：基于手势文本行的轮廓与非基于轮廓的分词：实验分析

Text line extraction from handwritten document pages based on line contour estimation

摘要

著录项

相似文献

相关主题

期刊订阅