首页> 外国专利> PARALLELIZATION OF NODE'S FAULT TOLERENT RECORD LINKAGE USING SMART INDEXING AND HIERARCHICAL CLUSTERING

PARALLELIZATION OF NODE'S FAULT TOLERENT RECORD LINKAGE USING SMART INDEXING AND HIERARCHICAL CLUSTERING

机译:利用智能索引和层次聚类法对Nodes容错记录链接进行并行化

摘要

Embodiments include a computer-implemented method including identifying, by a primary computer device, a plurality of records, each record having one or more attributes; standardizing, by the primary computer device, each of the plurality of records; assigning, by the primary computer device, an index to one or more of the one or more attributes; providing, by the primary computer device, instructions for clustering the standardized plurality of records in parallel into one or more clusters, each cluster including records having the same index, the one or more clusters being in a group; receiving, by the primary computer device, one or more groups, each group including one or more clusters sharing a same index; and linking one or more of the plurality of records in a cluster with another one or more of the plurality of records in another cluster within a same group.
机译:实施例包括一种计算机实现的方法,该方法包括通过主计算机设备识别多个记录,每个记录具有一个或多个属性;由主计算机设备标准化多个记录中的每一个;所述主计算机设备为所述一个或多个属性中的一个或多个分配索引;由主计算机设备提供用于将标准化的多个记录并行地群集到一个或多个群集中的指令,每个群集包括具有相同索引的记录,一个或多个群集在一组中;所述主计算机设备接收一个或多个组,每个组包括一个或多个共享相同索引的集群;并将群集中的多个记录中的一个或多个与同一组中另一个群集中的多个记录中的另一个或多个链接。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号