Microblog Hotspot Discovery Method Based on Improved K-Means Algorithm

机译：基于改进的K均值算法的微博热点发现方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The K-means algorithm is one of the most frequently used clustering algorithms in hot topic discovery. However, due to its shortcomings such as the number of clusters K value and easy to fall into local optimum, the clustering accuracy is not high, which directly affects the quality of hotspot discovery. This paper proposes an improved K-means algorithm to achieve fast clustering of microblog texts. Combining the high-frequency words and similarities of the microblog texts to perform single-pass clustering, the K number of clusters and the initial clustering center are obtained, which solves the problem that the K-means algorithm is too sensitive to the K value and the initial center. Through experimental comparison and analysis, it makes up for the shortcomings of K-means algorithm, and effectively improves the efficiency and accuracy of clustering. Applying it to the hot topic discovery model, the effectiveness of the hot spot discovery model based on the improved K-means algorithm is verified by experiments, and it has a high accuracy.

机译：K-means算法是热点话题发现中最常用的聚类算法之一。但是，由于聚类数K值多，容易陷入局部最优等缺点，聚类精度不高，直接影响热点发现的质量。本文提出了一种改进的K-means算法来实现微博客文本的快速聚类。结合高频词和微博文本的相似度进行单遍聚类，得到K个聚类和初始聚类中心，解决了K-means算法对K值过于敏感和最初的中心。通过实验比较分析，弥补了K-means算法的不足，有效提高了聚类的效率和准确性。将其应用于热点发现模型，通过实验验证了基于改进的K-means算法的热点发现模型的有效性，具有较高的准确性。

著录项

来源
《IEEE International Conference on High Performance Computing and Communications;IEEE International Conference on Smart City;IEEE International Conference on Data Science and Systems》|2019年|1220-1225|共6页
会议地点
作者
Qiang Gao; Jing Feng;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Clustering algorithms; Internet; Data models; Semantics; High frequency; Mathematical model; Data collection;

机译：聚类算法;互联网;数据模型;语义;高频;数学模型;数据收集;

相似文献

外文文献
中文文献
专利

1. Research on Hotspot Discovery in Internet Public Opinions Based on Improved K-Means [J] . Gensheng Wang Computational intelligence and neuroscience . 2013,第Null期

机译：基于改进的K均值的互联网舆论热点发现研究
2. A Hotspot Discovery Method Based on Improved FIHC Clustering Algorithm [J] . Lin Lina, Wei Dezhi Technical Gazette . 2021,第5期

机译：一种基于改进FIHC聚类算法的热点发现方法
3. Topic Analysis of Microblog About "Didi Taxi" Based on K-means Algorithm [J] . Yonghe Lu, Xin Xiong American Journal of Information Science and Technology . 2019,第3期

机译：基于K-means算法的“滴滴出租车”微博主题分析
4. Microblog Hotspot Discovery Method Based on Improved K-Means Algorithm [C] . Qiang Gao, Jing Feng IEEE International Conference on High Performance Computing and Communications . 2019

机译：基于改进的K均值算法的MicroBlog热点发现方法
5. Content-Based Earth Observation Data Discovery Methods Based on Intelligent Algorithms [D] . Cui, Kejin. 2020

机译：基于智能算法的基于内容的地球观测数据发现方法
6. Research on Hotspot Discovery in Internet Public Opinions Based on Improved K-Means [O] . Gensheng Wang 2013

机译：基于改进的K均值的互联网舆论热点发现研究
7. A Rapid and High-Precision Mountain Vertex Extraction Method Based on Hotspot Analysis Clustering and Improved Eight-Connected Extraction Algorithms for Digital Elevation Models [O] . Zhenqi Zheng, Xiongwu Xiao, Zhi-Chao Zhong, 2020

机译：一种基于热点分析聚类的快速和高精度的山顶顶点提取方法，并改进了数字高度模型的八连接提取算法

Microblog Hotspot Discovery Method Based on Improved K-Means Algorithm

摘要

著录项

相似文献

相关主题

期刊订阅