Semi-supervised Learning For Detecting Text-lines in Noisy Document Images

机译：半监督学习，用于检测嘈杂文档图像中的文本行

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Document layout analysis is a key step in document image understanding with wide applications in document digitization and reformatting. Identifying correct layout from noisy scanned images is especially challenging. In this paper, we introduce a semi-supervised learning framework to detect text-lines from noisy document images. Our framework consists of three steps. The first step is the initial segmentation that extracts text-lines and images using simple morphological operations. The second step is a grouping-based layout analysis that identifies text-lines, image zones, column separator and vertical border noise. It is able to efficiently remove the vertical border noises from multi-column pages. The third step is an online classifier that is trained with the high confidence line detection results from Step Two, and filters out noise from low confidence lines. The classifier effectively removes speckle noises embedded inside the content zones.rnWe compare the performance of our algorithm to the state-of-the-art work in the field on the UW-III database. We choose the results reported by the Image Understanding Pattern Recognition Research (IUPR) and Scansoft Omnipage SDK 15.5. We evaluate the performances at both the page frame level and the text-line level. The result shows that our system has much lower false-alarm rate, while maintains similar content detection rate. In addition, we also show that our online training model generalizes better than algorithms depending on offline training.

机译：文档布局分析是文档图像理解的关键步骤，在文档数字化和重新格式化方面具有广泛的应用。从嘈杂的扫描图像中识别正确的布局尤其具有挑战性。在本文中，我们介绍了一种半监督学习框架，可从嘈杂的文档图像中检测文本行。我们的框架包括三个步骤。第一步是使用简单的形态学操作提取文本行和图像的初始分割。第二步是基于分组的布局分析，可识别文本行，图像区域，列分隔符和垂直边框噪声。它能够有效地消除多列页面的垂直边框噪声。第三步是在线分类器，使用第二步中的高置信度线检测结果对其进行训练，并过滤掉低置信度线中的噪声。该分类器有效地消除了嵌入在内容区域内的斑点噪声。我们将算法的性能与UW-III数据库中该领域的最新技术进行了比较。我们选择图像理解模式识别研究（IUPR）和Scansoft Omnipage SDK 15.5报告的结果。我们评估页面框架级别和文本行级别的性能。结果表明，我们的系统具有较低的误报率，同时保持了相似的内容检测率。此外，我们还表明，根据离线培训，我们的在线培训模型比算法具有更好的泛化能力。

著录项

来源
《Document recognition and retrieval XVII》|2010年|P.75340C.1-75340C.10|共10页
会议地点 San Jose CA(US);San Jose CA(US);San Jose CA(US)
作者
Zongyi Liu; rnHarming Zhou;
展开▼
作者单位

Amazon.com, Fifth Avenue Suite 1500, Seattle, WA 98104;

rnAmazon.com, Fifth Avenue Suite 1500, Seattle, WA 98104;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类情报检索;
关键词
text-line detection; document segmentation; semi-supervised learning;

机译：文本行检测；文件分割半监督学习;

相似文献

外文文献
中文文献
专利

1. Un-supervised and semi-supervised hand segmentation in egocentric images with noisy label learning [J] . Li Yinlin, Jia Lihao, Wang Zidong, Neurocomputing . 2019,第MARa21期

机译：带有噪声标签学习的自我中心图像中的无监督和半监督手分割
2. Semi-supervised learning for text-line detection [J] . Zongyi Liu, rnHanning Zhou, rnNing Yang Pattern recognition letters . 2010,第11期

机译：文本行检测的半监督学习
3. Text-line extraction from handwritten document images using GAN [J] . Kundu Soumyadeep, Paul Sayantan, Bera Suman Kumar, Expert Systems with Application . 2020,第Feba期

机译：使用GAN从手写文档图像中提取文本行
4. Semi-supervised Learning For Detecting Text-lines in Noisy Document Images [C] . Zongyi Liu, Harming Zhou Electronic Imaging Science and Technology Symposium . 2010

机译：用于检测嘈杂文档图像中的文本线路的半监督学习
5. Information Preserving Processing of Noisy Handwritten Document Images [D] . Chen, Jin 2015

机译：嘈杂的手写文档图像的信息保存处理
6. Federated Semi-Supervised Multi-Task Learning to Detect COVID-19 and Lungs Segmentation Marking Using Chest Radiography Images and Raspberry Pi Devices: An Internet of Medical Things Application [O] . Mahbub Ul Alam, Rahim Rahmani 2021

机译：联邦半监督的多任务学习用胸部射线照相图像和覆盆子PI器件检测Covid-19和肺部分割标记：应用程序互联网
7. Spotting separator points at line terminals in compressed document images for text-line segmentation [O] . Amarnath R., Nagabhushan P. 2017

机译：在压缩文档图像中的行终端处发现分隔符点，以进行文本行分割

Semi-supervised Learning For Detecting Text-lines in Noisy Document Images

摘要

著录项

相似文献

相关主题

期刊订阅