>With the overflowing of Short Message Service (SMS) spam nowadays, many traditional t'/> Bi-Term Topic Model for SMS Classification
【24h】

Bi-Term Topic Model for SMS Classification

机译:SMS分类的双向术语模型

获取原文
获取原文并翻译 | 示例
           

摘要

>With the overflowing of Short Message Service (SMS) spam nowadays, many traditional text classification algorithms are used for SMS spam filtering. Nevertheless, because the content of SMS spam messages are miscellaneous and distinct from general text files, such as more shorter, usually including mass of abbreviations, symbols, variant words and distort or deform sentences, the traditional classifiers aren't fit for the task of SMS spam filtering. In this paper, the authors propose a Short Message Biterm Topic Model (SM-BTM) which can be used to automatically learn latent semantic features from SMS spam corpus for the task of SMS spam filtering. The SM-BTM is based on the probability of topic model theory and Biterm Topic Model (BTM). The experiments in this work show the proposed model SM-BTM can acquire higher quality of topic features than the original BTM, and is more suitable for identifying the miscellaneous SMS spam.
机译: >随着短消息服务(SMS)垃圾邮件如今,许多传统的文本分类算法已用于SMS垃圾邮件过滤。但是,由于SMS垃圾邮件的内容是杂项,并且与一般的文本文件不同,例如更短,通常包括大量的缩写,符号,变体词以及歪曲或变形的句子,因此传统分类器不适合SMS垃圾邮件过滤。在本文中,作者提出了一种短消息双项主题模型(SM-BTM),该模型可用于自动从SMS垃圾邮件语料库中学习潜在的语义特征,以完成SMS垃圾邮件过滤的任务。 SM-BTM基于主题模型理论和Biterm主题模型(BTM)的概率。这项工作中的实验表明,与原始BTM相比,所提出的SM-BTM模型具有更高的主题特征质量,并且更适合于识别其他SMS垃圾邮件。

著录项

  • 来源
  • 作者单位

    The Laboratory for Internet of Things and Mobile Internet Technology of Jiangsu Province, Huaiyin Institute of Technology, Huaian, China & College of Computer and Information, Hohai University, Nanjing, China;

    Faculty of Computer and Software Engineering, Huaiyin Institute of Technology, Huaian, China & College of Computer and Information, Hohai University, Nanjing, China;

    Faculty of Management Engineering, Huaiyin Institute of Technology, Huaian, China;

    Faculty of Computer and Software Engineering, Huaiyin Institute of Technology, Huaian, China;

    Faculty of Computer and Software Engineering, Huaiyin Institute of Technology, Huaian, China;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    BTM; Collaborative Information System; LDA; SM-BTM; SMS Spam; Topic Model;

    机译:BTM;协作信息系统;LDA;SM-BTM;SMS垃圾邮件;主题模型;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号