...
首页> 外文期刊>Journal of Computers >Toward a Complex System for Context Discovery to Index Arabic Documents
【24h】

Toward a Complex System for Context Discovery to Index Arabic Documents

机译:朝着一个复杂的系统,以获取对索引阿拉伯文档的上下文发现

获取原文
           

摘要

Text indexing aims to take the full advantage of textual data to help intelligent programs to make relevant decisions. In order to explore a large amount of textual documents, and to disclose semantic information hidden in unstructured documents, like texts, an effective indexation system is required. In this paper, we propose a new approach for indexing Arabic texts. Based on the semantic proximity and taking into account the contexts contained in each document, our method is denoted contextual indexing. Several algorithms are used for keywords extraction, each of them emphasizes some criterion. However, we target the most descriptive keywords for each document. We also propose a new approach for document modeling. We compared the results obtained using our method with those obtained by an indexation system based on a standard statistical method. The experimental results demonstrate the performance of our approach.
机译:文本索引旨在充分利用文本数据,以帮助智能计划进行相关的决策。为了探索大量的文本文档,并披露隐藏在非结构化文档中的语义信息,如文本,需要有效的索引系统。在本文中,我们提出了一种索引阿拉伯语文本的新方法。基于语义接近并考虑到每个文档中包含的上下文,我们的方法是表示上下文索引。几种算法用于关键字提取,每个算法都强调一些标准。但是,我们针对每个文档的最具描述性关键字。我们还提出了一种新的文档建模方法。我们比较了使用我们的方法获得的结果,其中基于标准统计方法通过分度系统获得的结果。实验结果表明了我们方法的表现。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号