首页> 外文会议>International Conference on Electronic, Mechanical, Information and Management >Microblogging Short Text Classification based on Word2Vec
【24h】

Microblogging Short Text Classification based on Word2Vec

机译:基于Word2Vec的微博短文本分类

获取原文

摘要

For the sparse features of the microblogging text, the author proposes a method of microblogging text classification based on the features extension by Word2Vec. We train the text by using Word2Vec tool and find the words which are similar to original features semantic as the features of short text. Then we expand the features to the original text, and finally classify the subject of microblogging text by using SVM method. Experimental results show that the method has high accuracy recall and F1 values compared with the traditional method of vector space model and LDA topic model.
机译:对于微博文本的稀疏特征,作者提出了一种基于Word2Vec的功能扩展的微博文本分类方法。我们使用Word2Vec工具培训文本,并找到与原始特征语义类似的单词作为短文本的功能。然后我们将功能扩展到原始文本,最后使用SVM方法对微博文本的主题进行分类。实验结果表明,与传统的矢量空间模型和LDA主题模型相比,该方法具有高精度召回和F1值。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号