首页> 外文会议>2015 Latin America Congress on Computational Intelligence >A study on topics identification on Twitter using clustering algorithms
【24h】

A study on topics identification on Twitter using clustering algorithms

机译:基于聚类算法的Twitter主题识别研究

获取原文
获取原文并翻译 | 示例

摘要

The identification of topics in Social Networks has become an important research task when dealing with event detection, particularly when global communities are affected. Text processing techniques and machine learning algorithms have been extensively used to solve this problem. In this paper we compare three clustering algorithms - k-means, k-medoids and NMF (Non-negative Matrix Factorization) - in order to detect topics related to textual messages obtained from Twitter. The algorithms were applied to a database composed by tweets, having as initial context hashtags that are related to the recent scandal of corruption involving FIFA (International Federation of Football Association). Obtained results suggest that the NMF presents better results, since it provides providing clusters that are easier to interpret.
机译:在处理事件检测时,尤其是在全球社区受到影响时,社交网络中主题的标识已成为一项重要的研究任务。文本处理技术和机器学习算法已广泛用于解决此问题。在本文中,我们比较了三种聚类算法-k-means,k-medoids和NMF(非负矩阵分解),以便检测与从Twitter获得的文本消息相关的主题。将该算法应用于由推文组成的数据库,该数据库具有与最近涉及FIFA(国际足联)的腐败丑闻有关的标签作为初始上下文。获得的结果表明,NMF提供了更好的结果,因为它提供了易于解释的聚类。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号