首页> 外文会议>2017 IEEE International Conference on Big Knowledge >Modeling Topic Evolution in Social Media Short Texts
【24h】

Modeling Topic Evolution in Social Media Short Texts

机译:社交媒体短文中主题演变的建模

获取原文
获取原文并翻译 | 示例

摘要

Social media short texts like tweets and instant messages provide a lot of valuable information about the hot topics and public opinion. Detecting and tracking topics from these online contents can help people grasp the essential information and its evolution and facilitate many applications. Topic evolution models built based on LDA need to set the topic number manually, which could not change during different time periods and could not be adjusted based on the contents. The nonparametric topic evolution models do not perform very well on short texts due to the data sparsity problem. So in this paper, we propose a nonparametric topic evolution model for short texts. The model uses the recurrent Chinese restaurant process as the prior distribution of topic proportions. Combining it with word co-occurrence modeling, we construct a topic evolution model which is suitable for social media short texts. We carry out experimental studies on twitter dataset. The results show that our method outperforms the baseline methods and could monitor the topic evolution in social media short texts effectively.
机译:诸如推文和即时消息之类的社交媒体短文本提供了有关热门话题和公众舆论的大量宝贵信息。从这些在线内容中检测和跟踪主题可以帮助人们掌握基本信息及其发展,并促进许多应用。基于LDA构建的主题演化模型需要手动设置主题编号,该主题编号在不同时间段内无法更改,并且无法根据内容进行调整。由于数据稀疏性问题,非参数主题演化模型在短文本上效果不佳。因此,在本文中,我们提出了一种针对短文本的非参数主题演化模型。该模型使用递归中餐厅过程作为主题比例的先验分布。结合单词共现模型,我们构建了适合社交媒体短文本的主题演化模型。我们对twitter数据集进行实验研究。结果表明,我们的方法优于基线方法,可以有效地监控社交媒体短文中主题的演变。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号