首页> 中文期刊> 《计算机应用》 >基于聚类分析分库策略的社交网络数据库查询性能与数据迁移

基于聚类分析分库策略的社交网络数据库查询性能与数据迁移

         

摘要

Social network data has a certain degree of aggregation,namely the similar users are more prone to the same behavior.According to the conventional horizontal database shard method,a large amount of time and connection loss were consumed in order to access a plurality of databases in turn when performing the information query of these events.In order to solve this problem,the database shard strategy based on clustering analysis was proposed.Through clustering the characteristic scalars of social network subjects,the main body with the high aggregation was divided into one or as possible libraries to improve the query efficiency of the events,and to give consideration to load balancing,large data migration and other issues.The experimental results show that for the mainstream social networking events,the performance improvement of the proposed strategy is up to 23.4% at most,and local optimal load balance and zero data migration are realized.In general,the database shard strategy based on clustering analysis of social network,has a considerable advantage on improving query efficiency,balance load balancing and large data migration feasibility over the traditional conventional horizontal database shard method of cutting library.%社交网络数据具有一定的聚合性,即特征上相近的用户之间更容易产生某种行为.依照常规的水平切分方法,在执行这些事件的信息查询时,将会耗费大量的时间和连接损耗去依次访问多个数据库.针对此问题,提出了基于聚类分析的社交网络数据库分库策略.将社交网络主体的特征标量进行聚类,使得聚集程度高的主体尽量分割到一个或尽可能少的几个分库中去,从而提高事件的查询效率,并在此基础上兼顾负载均衡与大数据迁移等问题.实验结果表明,该策略在社交网络的主流事件查询上都表现出不同程度的性能提升,最高提升程度达到23.4%,并且实现了局部最优负载均衡和零数据迁移.总的来说,基于聚类分析的社交网络数据库分库策略在提高查询效率、平衡负载以及大数据迁移可行性上,比传统水平切割分库有了相当的优势.

著录项

相似文献

  • 中文文献
  • 外文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号