Toward a Complex System for Context Discovery to Index Arabic Documents

Mohamed Salim El Bazzi; Driss Mammass; Abdelatif Ennaji; Taher Zaki

首页> 外文期刊>Journal of Computers >Toward a Complex System for Context Discovery to Index Arabic Documents

【24h】

Toward a Complex System for Context Discovery to Index Arabic Documents

机译：朝着一个复杂的系统，以获取对索引阿拉伯文档的上下文发现

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Text indexing aims to take the full advantage of textual data to help intelligent programs to make relevant decisions. In order to explore a large amount of textual documents, and to disclose semantic information hidden in unstructured documents, like texts, an effective indexation system is required. In this paper, we propose a new approach for indexing Arabic texts. Based on the semantic proximity and taking into account the contexts contained in each document, our method is denoted contextual indexing. Several algorithms are used for keywords extraction, each of them emphasizes some criterion. However, we target the most descriptive keywords for each document. We also propose a new approach for document modeling. We compared the results obtained using our method with those obtained by an indexation system based on a standard statistical method. The experimental results demonstrate the performance of our approach.

机译：文本索引旨在充分利用文本数据，以帮助智能计划进行相关的决策。为了探索大量的文本文档，并披露隐藏在非结构化文档中的语义信息，如文本，需要有效的索引系统。在本文中，我们提出了一种索引阿拉伯语文本的新方法。基于语义接近并考虑到每个文档中包含的上下文，我们的方法是表示上下文索引。几种算法用于关键字提取，每个算法都强调一些标准。但是，我们针对每个文档的最具描述性关键字。我们还提出了一种新的文档建模方法。我们比较了使用我们的方法获得的结果，其中基于标准统计方法通过分度系统获得的结果。实验结果表明了我们方法的表现。

著录项

来源
《Journal of Computers》 |2018年第8期|共8页
作者
Mohamed Salim El Bazzi; Driss Mammass; Abdelatif Ennaji; Taher Zaki;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Visualization Systems Supporting the Reading of Arabic Document for non Arabic Speakers [J] . R. J. R. Yusof, R. Zainuddin, M. S. Baba, Information Technology Journal . 2009,第1期

机译：支持非阿拉伯语阅读者阅读阿拉伯语文档的可视化系统
2. Visualization Systems Supporting the Reading of Arabic Document for non Arabic Speakers [J] . R.J.R. Yusof, R. Zainuddin, M.S. Baba, Information Technology Journal . 2009,第1期

机译：支持非阿拉伯语阅读者阅读阿拉伯语文档的可视化系统
3. Plagiarism Detection in Arabic Documents: Approaches, Architecture and Systems [J] . Boubaker Kahloula, Jawad Berri Journal of digital information management . 2016,第2期

机译：阿拉伯文档中的抄袭检测：方法，体系结构和系统
4. Physical Layout Analysis of Complex Structured Arabic Documents Using Artificial Neural Nets [C] . Karim Hadjar, Rolf Ingold IAPR Workshop on Document Analysis Systems . 2004

机译：使用人工神经网的复杂结构阿拉伯文文件的物理布局分析
5. Complex system contextual framework (CSCF): A grounded-theory construction for the articulation of system context in addressing complex systems problems. [D] . Crownover, W. B. Max. 2005

机译：复杂系统上下文框架（CSCF）：用于解决复杂系统问题的系统上下文表达的基础理论构建。
6. A Fast Document Classification Algorithm for Gene Symbol Disambiguation in the BITOLA Literature-Based Discovery Support System [O] . Andrej Kastrin, Dimitar Hristovski 2008

机译：基于BITOLA文献的发现支持系统中用于基因符号消除歧义的快速文档分类算法
7. Toward a Complex System for Context Discovery to Index Arabic Documents [O] . Mohamed Salim El Bazzi 2018

机译：朝着一个复杂的系统，以获取对索引阿拉伯文档的上下文发现

Toward a Complex System for Context Discovery to Index Arabic Documents

摘要

著录项

相似文献

相关主题

期刊订阅