首页> 外文会议>Annual meeting of the Association for Computational Linguistics >Leveraging Meta Information in Short Text Aggregation
【24h】

Leveraging Meta Information in Short Text Aggregation

机译:利用短文本聚合中的元信息

获取原文

摘要

Analysing topics in short texts (e.g., tweets and new headings) is a challenging task because short texts often contain insufficient word co-occurrence information, which is important to learn good topics in conventional topic topics. To deal with the insufficiency, we propose a generative model that aggregates short texts into clusters by leveraging the associated meta information. Our model can generate more interpretable topics as well as document clusters. We develop an effective Gibbs sampling algorithm favoured by the fully local conjugacy in the model. Extensive experiments demonstrate that our model achieves better performance in terms of document clustering and topic coherence.
机译:分析短文本的主题(例如,推文和新标题)是一个具有挑战性的任务,因为短篇文本通常包含不足的单词共同信息信息,这对于在传统主题主题中学习良好的主题非常重要。为了处理不足,我们提出了一种通过利用相关的元信息来聚合短文本的生成模型。我们的模型可以生成更多可解释的主题以及文档集群。我们开发了一个有效的GIBBS采样算法,由模型中的完全局部共轭有利。广泛的实验表明,我们的模型在文档聚类和主题一致性方面取得了更好的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号