【24h】

Using Active Learning in Text Classification of Quranic Sciences

机译:在古兰经科学的文本分类中使用主动学习

获取原文
获取原文并翻译 | 示例

摘要

The key idea behind active learning is that if the learning method is allowed to choose the data to learn from, the amount of data needed for the training phase can be significantly reduced. Thus, the cost of manual annotating the data will be less, and the process of learning can be accelerated. Most of the studies on applying active learning methods to automatic text classification focused on requesting the label of a single unlabeled document in each iteration. Unlike English, There are very few researches done in this area for the Arabic text. In this paper, we present a novel active learning method for Arabic text classification using multi-class SVM. The proposed method selects a batch of informative samples for manually labeling by an expert. The experimental results show that employing our method can significantly reduce the need for labeled training data.
机译:主动学习背后的关键思想是,如果允许学习方法选择要学习的数据,则可以显着减少训练阶段所需的数据量。因此,手动注释数据的成本将降低,并且可以加快学习过程。将主动学习方法应用于自动文本分类的大多数研究都集中于在每次迭代中请求单个未标记文档的标记。与英语不同,在该领域对阿拉伯文本的研究很少。在本文中,我们提出了一种使用多类SVM进行阿拉伯文本分类的新型主动学习方法。所提出的方法选择一批信息性样本以供专家手动标记。实验结果表明,采用我们的方法可以大大减少对标记训练数据的需求。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号