首页> 外文会议>IEEE/ACS International Conference on Computer Systems and Applications >Automatic authorship classification of two ancient books: Quran and Hadith
【24h】

Automatic authorship classification of two ancient books: Quran and Hadith

机译:自动对两本古书进行作者分类:古兰经和圣训

获取原文

摘要

Nowadays the need of a scientific and rigorous tool of automatic authorship classification has become pretty important, especially for ancient documents authentication such as religious or historical books. Hence, in this paper, we conduct some experiments of authorship classification on the Quran and Hadith in order to see if they could have the same author or not (ie. Was the Quran written by the Prophet or only sent down to him, as claimed?). This task, which is commonly called authorship discrimination, represents an important authorship classification application. It consists in checking whether two texts are written by the same author or not by using some AI (Artificial Intelligence) and TM (Text mining) techniques. In our case, two main investigations are conducted and presented: in the first one, the two books are analyzed in a global form; in the second investigation, the two books are segmented into 25 different text segments: 14 segments are extracted from the Quran and 11 ones are extracted from the Hadith. The different segments have more or less the same size, with approximately 2080 tokens per text segment. Several classifiers are employed: SMO-based Support Vector Machines (SVM), Multi Layer Perceptron (MLP) and Linear Regression (LR). This research work has allowed getting extremely interesting information on the ancient books origins.
机译:如今,对科学和严格的自动作者资格分类工具的需求已变得非常重要,尤其是对于古代文献认证(例如宗教或历史书籍)而言。因此,在本文中,我们对《古兰经》和《圣训》进行了作者身份分类实验,以了解他们是否可以拥有同一作者(即,《古兰经》是先知所写的还是只寄给了他,如所声称的那样) ?)。此任务通常称为作者身份歧视,它代表着重要的作者身份分类应用程序。它包括通过使用某些AI(人工智能)和TM(文本挖掘)技术来检查两个文本是否由同一作者编写。在我们的案例中,进行并提出了两个主要的调查:在第一个调查中,以全局的形式对这两本书进行了分析;在第二次调查中,将这两本书分为25个不同的文本段:从《古兰经》中提取了14个段,从《圣训》中提取了11个段。不同的段具有或多或少相同的大小,每个文本段大约有2080个令牌。使用了几种分类器:基于SMO的支持向量机(SVM),多层感知器(MLP)和线性回归(LR)。这项研究工作使人们获得了有关古代书籍起源的极其有趣的信息。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号