首页> 外文期刊>Abstract and applied analysis >A Solution to Reconstruct Cross-Cut Shredded Text Documents Based on Character Recognition and Genetic Algorithm
【24h】

A Solution to Reconstruct Cross-Cut Shredded Text Documents Based on Character Recognition and Genetic Algorithm

机译:基于字符识别和遗传算法的交叉剪切文本文档重构解决方案

获取原文
           

摘要

The reconstruction of destroyed paper documents is of more interest during the last years. This topic is relevant to the fields of forensics, investigative sciences, and archeology. Previous research and analysis on the reconstruction of cross-cut shredded text document (RCCSTD) are mainly based on the likelihood and the traditional heuristic algorithm. In this paper, a feature-matching algorithm based on the character recognition via establishing the database of the letters is presented, reconstructing the shredded document by row clustering, intrarow splicing, and interrow splicing. Row clustering is executed through the clustering algorithm according to the clustering vectors of the fragments. Intrarow splicing regarded as the travelling salesman problem is solved by the improved genetic algorithm. Finally, the document is reconstructed by the interrow splicing according to the line spacing and the proximity of the fragments. Computational experiments suggest that the presented algorithm is of high precision and efficiency, and that the algorithm may be useful for the different size of cross-cut shredded text document.
机译:在过去的几年中,重建被销毁的纸质文件更加令人关注。该主题与法医学,调查科学和考古学领域相关。以往对交叉切割文本文档(RCCSTD)重建的研究和分析主要基于似然法和传统启发式算法。提出了一种基于字符识别的特征匹配算法,该算法通过建立字母数据库,通过行聚类,行内拼接和行内拼接来重建切碎的文档。行聚类是根据片段的聚类向量通过聚类算法执行的。通过改进的遗传算法解决了行内拼接被视为旅行商的问题。最后,根据行间距和片段的接近程度,通过行间拼接来重建文档。计算实验表明,该算法具有较高的精度和效率,对于不同尺寸的横切文本文件可能有用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号