【24h】

A Rule-Based Annotation System to Extract Tajweed Rules from Quran

机译:基于规则的注释系统,用于从古兰经中提取Tajweed规则

获取原文

摘要

Quran Recitation relies on identifying and applying different Tajweed rules [T??ú? ?áêì?í?] such as Muddud [????] and Tanween [ê??í?] in the Quran text. This research is aimed at providing a tool that automatically finds and annotates letters that embody Tajweed rules in Quran text. This field remains an open research area due to the lack of open source NLP tools that support the Arabic language. Applying Natural Language Processing (NLP) techniques on Quran text to extract Tajweed letters is considered an important Information Extraction (IE) step. This research explores the field of applying IE techniques on Quran text. Rule based IE techniques are well known to achieve optimal results. This research explores NLP techniques on Quranic text using GATE, an open source flexible NLP environment. GATE is employed for this research to build the application that processes un-annotated Quranic text corpus. The developed application is evaluated using the well known IE evaluation metrics precision and recall. By comparing the system's automatically annotated text with a gold standard (i.e. Quran text). The system proved to be efficient by achieving 100% precision and recall of the implemented Tajweed rules.
机译:古兰经叙述依赖于识别和应用不同的塔赫行规则[T ??ú? ?áêì?í?]如muddud [????]和tanween [ê??í?]在古兰经文本中。该研究旨在提供一个自动发现和注释在古兰经文本中体现Tajweed规则的字母的工具。由于缺乏支持阿拉伯语的开源NLP工具,此字段仍然是开放的研究区。在古兰经文本上应用自然语言处理(NLP)技术提取Tajweed字母被认为是一个重要的信息提取(即)步骤。本研究探讨了古兰经文本上应用IE技术的领域。基于规则的IE技术是众所周知的,可以实现最佳结果。这项研究探讨了使用门的古兰经文本的NLP技术,开源灵活的NLP环境。门用于该研究以构建处理未注释的QURANIC文本语料库的应用程序。使用众所周知的IE评估度量精度和召回评估开发的应用。通过将系统自动注释的文本与金标准(即古兰经文本)进行比较。通过实现100%的精确和召回实施的拖拉规则,该系统证明是有效的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号