首页> 外文会议>International Conference on Computational Science >A High-Performance Implementation of Bayesian Matrix Factorization with Limited Communication
【24h】

A High-Performance Implementation of Bayesian Matrix Factorization with Limited Communication

机译:通信受限的贝叶斯矩阵分解的高性能实现

获取原文

摘要

Matrix factorization is a very common machine learning technique in recommender systems. Bayesian Matrix Factorization (BMF) algorithms would be attractive because of their ability to quantify uncertainty in their predictions and avoid over-fitting, combined with high prediction accuracy. However, they have not been widely used on large-scale data because of their prohibitive computational cost. In recent work, efforts have been made to reduce the cost, both by improving the scalability of the BMF algorithm as well as its implementation, but so far mainly separately. In this paper we show that the state-of-the-art of both approaches to scalability can be combined. We combine the recent highly-scalable Posterior Propagation algorithm for BMF, which parallelizes computation of blocks of the matrix, with a distributed BMF implementation that users asynchronous communication within each block. We show that the combination of the two methods gives substantial improvements in the scalability of BMF on web-scale datasets, when the goal is to reduce the wall-clock time.
机译:矩阵分解是推荐系统中非常普通的机器学习技术。贝叶斯矩阵分解(BMF)算法将具有吸引力,因为它们能够量化其预测中的不确定性并避免过度拟合,以及高预测精度。然而,由于其禁止的计算成本,它们未被广泛使用大规模数据。在最近的工作中,已经通过提高BMF算法的可扩展性以及其实现来降低成本,以降低成本,但到目前为止主要是单独的。在本文中,我们表明,可以组合到可扩展性的最先进的方法。我们将最近的BMF的高度可扩展的后传播算法结合起来,其并行化矩阵块的计算,具有分布式BMF实现,其用户在每个块内的异步通信。我们表明两种方法的组合在目标是减少壁钟时间时,这两种方法的组合在网级数据集上的BMF的可扩展性进行了大量的改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号