...
首页> 外文期刊>Knowledge-Based Systems >MDLText: An efficient and lightweight text classifier
【24h】

MDLText: An efficient and lightweight text classifier

机译:MDLText:高效轻量的文本分类器

获取原文
获取原文并翻译 | 示例
           

摘要

In many areas, the volume of text information is increasing rapidly, thereby demanding efficient text classification approaches. Several methods are available at present, but most exhibit declining performance as the dimensionality of the problem increases, or they incur high computational costs for training, which limit their application in real scenarios. Thus, it is necessary to develop a method that can process high dimensional data in a rapid manner. In this study, we propose the MDLText, an efficient, lightweight, scalable, and fast multinomial text classifier, which is based on the minimum description length principle. MDLText exhibits fast incremental learning as well as being sufficiently robust to prevent overfitting, which are desirable features in real-world applications, large-scale problems, and online scenarios. Our experiments were carefully designed to ensure that we obtained statistically sound results, which demonstrated that the proposed approach achieves a good balance between predictive power and computational efficiency. (C) 2016 Elsevier B.V. All rights reserved.
机译:在许多领域,文本信息的数量正在迅速增加,因此需要有效的文本分类方法。当前有几种方法可用,但是随着问题的维数增加,大多数方法的性能都会下降,或者它们会带来很高的训练计算成本,这限制了它们在实际场景中的应用。因此,有必要开发一种可以快速处理高维数据的方法。在这项研究中,我们提出了MDLText,这是一种基于最小描述长度原则的高效,轻量,可伸缩和快速的多项式文本分类器。 MDLText具有快速的增量学习功能,并且具有足够的鲁棒性以防止过度拟合,这是现实应用程序,大规模问题和在线方案中的理想功能。我们精心设计了实验,以确保我们获得统计上合理的结果,这表明所提出的方法在预测能力和计算效率之间取得了良好的平衡。 (C)2016 Elsevier B.V.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号