...
首页> 外文期刊>International journal of information retrieval research >Towards Building an Arabic Plagiarism Detection System: Plagiarism Detection in Arabic
【24h】

Towards Building an Arabic Plagiarism Detection System: Plagiarism Detection in Arabic

机译:迈向建立阿拉伯语System窃检测系统:阿拉伯语中的Pla窃检测

获取原文
获取原文并翻译 | 示例
           

摘要

This article describes a plagiarism detection system for the Arabic language that combines different similarity-measure techniques to uncover plagiarism in Arabic documents. The proposed system consists of two main components, one document-retrieval and the other detailed similarity analysis. The document-retrieval component generates queries from a given suspicious document and makes use of Google search API to retrieve candidate source documents from the Web. The similarity analysis component takes each source document in turn and attempts to identify the plagiarized parts in the suspicious document. The proposed system is thoroughly evaluated using an indigenous corpus. At the document-retrieval level, the system achieved above 75% accuracy in terms of f-score, whereas at the detailed similarity-computation level, the f-score is above 70%.
机译:本文介绍了一种针对阿拉伯语的窃检测系统,该系统结合了不同的相似性度量技术以发现阿拉伯文文档中的窃行为。提议的系统由两个主要部分组成,一个是文档检索,另一个是详细的相似性分析。文档检索组件从给定的可疑文档生成查询,并利用Google搜索API从Web检索候选源文档。相似性分析组件依次获取每个源文档,并尝试识别可疑文档中的窃部分。使用本地语料对提议的系统进行了全面评估。在文档检索级别,该系统的f分数精度达到75%以上,而在详细的相似度计算级别,f分数则为70%以上。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号