首页> 外国专利> Decentralized Latent Semantic Index Using Distributed Average Consensus

Decentralized Latent Semantic Index Using Distributed Average Consensus

机译:使用分布式平均共识的分散潜在语义索引

摘要

A distributed computing device calculates word counts for each of a set of documents. The word counts are represented as values, each representing a number of times a corresponding word appears in one of the set of documents. The distributed computing device randomly samples the word counts to calculate sampled word counts. The distributed computing device and additional distributed computing devices iteratively execute a process to determine a consensus result for the sampled word counts based on the sampled word counts and additional sampled word counts calculated by the additional distributed computing devices. The distributed computing device determines a latent semantic index (LSI) subspace based on the consensus result for the sampled word count and reflecting contents of the set and additional sets of documents. The distributed computing device projects a document into the LSI subspace to determine the latent semantic content of the document.
机译:分布式计算设备计算一组文档中的每一个的字数。单词计数表示为值,每个值表示相应单词中的一组文档中的一个次数。分布式计算设备随机采样单词计数以计算采样字计数。分布式计算设备和附加的分布式计算设备迭代地执行用于基于所采样的字数和由附加分布式计算设备计算的其他采样字计数来确定采样字计数的共识结果的过程。分布式计算设备基于采样字计数的共识结果确定潜在语义索引(LSI)子空间,并反映集合的内容以及其他一组文档。分布式计算设备将文档投影到LSI子空间中以确定文档的潜在语义内容。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号