...
首页> 外文期刊>Multimedia Tools and Applications >Graphical-character-based shredded Chinese document reconstruction
【24h】

Graphical-character-based shredded Chinese document reconstruction

机译:基于图形字符的中文文档切细重建

获取原文
获取原文并翻译 | 示例
           

摘要

Paper documents are shredded into pieces by a shredder in what is currently a common means of ensuring text information security. Because such pieces have certain characteristics, such as being of large number and low discrimination, shredded document reconstruction by a reverse operation represents a challenge. However, recovering shredded documents is an important research aspect of digital forensics and has broad applicability in information security and judicial investigations. Researchers have proposed various feasible algorithms to restore shredded documents; however, most such algorithms are aimed at western language documents. Because of large differences between languages, these algorithms are difficult to apply to other language document reconstruction tasks directly. The Chinese language is used worldwide. Chinese documents are also widely used; accordingly, there are great demands for Chinese document reconstruction. This paper presents a complete shredded Chinese document reconstruction algorithm. According to the structural features of the characters, we apply graphics processing to the texts in pieces, the pieces are matched by graph assembling, and the shredded document is restored. We test the algorithm's performance using an actual sample, and the experimental results show that the proposed method can effectively restore the shredded document. The average obtained accuracy is 85.78 %. Moreover, the algorithm is highly intelligent; a human only participates in the step that involves scanning the pieces, and the other calculation steps are automatically completed by the computer.
机译:纸质文档通过切碎机切成碎片,这是目前确保文本信息安全的常用方法。由于这样的片段具有某些特征,例如数量众多且辨别力低,因此通过反向操作进行的碎文档重建是一个挑战。但是,恢复粉碎的文档是数字取证的重要研究内容,在信息安全和司法调查中具有广泛的适用性。研究人员提出了各种可行的算法来还原文档碎片。但是,大多数这样的算法都针对西方语言文档。由于语言之间的巨大差异,这些算法很难直接应用于其他语言文档重建任务。中文在世界范围内使用。中文文件也被广泛使用;因此,对中文文件的重建有很高的要求。本文提出了一种完整的中文文档切细重建算法。根据字符的结构特点,我们对文本进行了图形处理,并通过图形组合来匹配文本,还原了切碎的文档。我们通过实际样本测试了该算法的性能,实验结果表明,该方法可以有效地还原文件。平均获得的准确度为85.78%。而且,该算法具有很高的智能性。人工只参与涉及扫描碎片的步骤,其他计算步骤由计算机自动完成。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号