首页> 外文期刊>Journal of Computers >Applications of Text Clustering Based on Semantic Body for Chinese Spam Filtering
【24h】

Applications of Text Clustering Based on Semantic Body for Chinese Spam Filtering

机译:文本群集基于语义体垃圾邮件过滤的应用

获取原文
           

摘要

—The effect of spam filtering method based on statistics is not good enough in filtering the new-type spam with synonymous substitution and camouflage, because the method based on statistics ignores the semantic relation between words in the text, and only judges from the word itself. So, a method of spam filtering based on the semantic body is proposed in this paper. The method adopts lexical chain based on HowNet and TFIDF method based on statistics to extract e-mail features, and handle spam with text clustering method. The result of the experiment shows that the new method proposed in this pager provides a good effect in filtering new-type spam.
机译:- 基于统计数据的垃圾邮件过滤方法的效果在过滤与同义替代和伪装的新型垃圾邮件时不够好,因为基于统计数据的方法忽略了文本中单词之间的语义关系,并且只有来自单词本身的判断。因此,本文提出了一种基于语义体的垃圾邮件过滤方法。该方法采用基于HONDET和TFIDF方法的词汇链,基于统计信息提取电子邮件功能,并用文本聚类方法处理垃圾邮件。实验结果表明,该寻呼机提出的新方法在过滤新型垃圾邮件方面提供了良好的效果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号