首页> 外文会议>IEEE International Conference on Tools with Artificial Intelligence >Topics may Evolve: Using Complaint Data for Analysis
【24h】

Topics may Evolve: Using Complaint Data for Analysis

机译:主题可能会演变:使用投诉数据进行分析

获取原文

摘要

User complaint data are quite valuable because they can reflect deficiencies of companies. Analyzing these short texts can help companies discover what topics users are complaining about. It is critical to locate and respond to these complaints timely so that companies can improve users' satisfaction and loyalty. As the data volume is large, topic model can help discover key complaint topics quickly. The complaint data are in the form of short texts and streams, traditional topic models like LDA and BTM are not suitable in this scenario, for the reason that LDA is designed for long texts and BTM can not handle streams. This paper firstly proposes an improved shorttext topic model called PMITI-BTM to generate topics from user complaint data statically, and then further extends this algorithm into a dynamic one to suit the streaming feature of data. To further analyze the topic evolution, we finally propose a clustering algorithm called TDWAP to acquire the evolution process of these topics in different time slices. For each algorithm, we do several experiments to prove its efficiency. Results show that our methods not only can improve the performance of short texts topic discovery, but also can discover the evolution of topics.
机译:用户投诉数据非常有价值,因为它们可以反映公司的缺陷。分析这些简短的文本可以帮助公司发现用户抱怨的主题。及时找到并响应这些投诉至关重要,这样公司才能提高用户的满意度和忠诚度。由于数据量很大,主题模型可以帮助快速发现关键的投诉主题。投诉数据采用短文本和流的形式,传统的主题模型(如LDA和BTM)不适用于这种情况,因为LDA是为长文本设计的,而BTM无法处理流。本文首先提出了一种改进的短文本主题模型,称为PMITI-BTM,可以从用户投诉数据中静态生成主题,然后将该算法进一步扩展为动态模型,以适应数据的流传输特征。为了进一步分析主题演变,我们最终提出了一种称为TDWAP的聚类算法,以获取这些主题在不同时间段内的演变过程。对于每种算法,我们进行了几次实验以证明其效率。结果表明,我们的方法不仅可以提高短文本主题发现的性能,而且可以发现主题的发展。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号