...
首页> 外文期刊>Review of the National Center for Digitization >Algorithm for Document Authorship Identification and Plagiarism Evaluation Based on Generalized Suffix Tree
【24h】

Algorithm for Document Authorship Identification and Plagiarism Evaluation Based on Generalized Suffix Tree

机译:基于广义后缀树的文献作者识别与抄袭评估的算法

获取原文
   

获取外文期刊封面封底 >>

       

摘要

Identifying an author of an anonymous text document is an important problem when dealing with historical data. As authors have their own characteristic writing styles, expressed through specific phrases, sentence constructions or word choices, their text documents incorporate the style and create implicit connection with the author. This paper proposes an approach for identification of authors of the anonymous documents, based on generalized suffix tree data structure and defined similarity score, suitable for analysis of digitized historical text documents. The following method can also be used for detecting and evaluating plagiarism, where the document author is known, but the document shows a high similarity with documents from another author.
机译:识别匿名文本文档的作者是处理历史数据时的重要问题。 由于作者具有自己的特征写作样式,通过特定短语,句子结构或单词选择表示,他们的文本文档包含了样式并与作者创建隐式连接。 本文提出了一种基于广义后缀树数据结构和定义的相似度分数的匿名文档识别作者的方法,适用于分析数字化历史文本文档。 以下方法还可用于检测和评估抄袭,其中已知文档作者,但文件显示了来自另一作者的文档的高相似性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号