首页> 外国专利> SPARSE AND DATA-PARALLEL INFERENCE METHOD AND SYSTEM FOR THE LATENT DIRICHLET ALLOCATION MODEL

SPARSE AND DATA-PARALLEL INFERENCE METHOD AND SYSTEM FOR THE LATENT DIRICHLET ALLOCATION MODEL

机译:分布式狄利克雷分配模型的稀疏和数据并行推理方法及系统

摘要

Herein is described a data-parallel and sparse algorithm for topic modeling. This algorithm is based on a highly parallel algorithm for a Greedy Gibbs sampler. The Greedy Gibbs sampler is a Markov-Chain Monte Carlo algorithm that estimates topics, in an unsupervised fashion, by estimating the parameters of the topic model Latent Dirichlet Allocation (LDA). The Greedy Gibbs sampler is a data-parallel algorithm for topic modeling, and is configured to be implemented on a highly-parallel architecture, such as a GPU. The Greedy Gibbs sampler is modified to take advantage of data sparsity while maintaining the parallelism. Furthermore, in an embodiment, implementation of the Greedy Gibbs sampler uses both densely-represented and sparsely-represented matrices to reduce the amount of computation while maintaining fast accesses to memory for implementation on a GPU.
机译:这里描述了用于主题建模的数据并行和稀疏算法。该算法基于Greedy Gibbs采样器的高度并行算法。 Greedy Gibbs采样器是一种Markov-Chain蒙特卡洛算法,它通过估计主题模型潜在Dirichlet分配(LDA)的参数,以无监督的方式估计主题。 Greedy Gibbs采样器是用于主题建模的数据并行算法,并配置为在高度并行的架构(例如GPU)上实现。对Greedy Gibbs采样器进行了修改,以在保持并行性的同时利用数据稀疏性。此外,在一个实施例中,贪婪吉布斯采样器的实现使用密集表示的矩阵和稀疏表示的矩阵两者来减少计算量,同时保持对存储器的快速访问以在GPU上实现。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号