首页> 外文会议>2018 IEEE Indian Conference on Antennas and Propogation >In-Core LU-Decomposition of Symmetrical Dense MoM Matrix in WIPL-D Multi-GPU Solver
【24h】

In-Core LU-Decomposition of Symmetrical Dense MoM Matrix in WIPL-D Multi-GPU Solver

机译:WIPL-D多GPU解算器中对称密集MoM矩阵的核内LU分解

获取原文
获取原文并翻译 | 示例

摘要

Acceleration of in-core LU decomposition of symmetrical dense MoM matrices by using multiple GPUs in parallel is presented in this paper. Memory limitations of GPUs are overcome by using block LU decomposition, where the entire system matrix is stored in CPU RAM, while only processed blocks are stored in GPU VRAM. The presented algorithm for LU decomposition enables highly efficient utilization of an arbitrary number of GPUs. Comparison of performance of up to 4 GTX 680 GPUs and up to 4 GTX 1080 Ti GPUs is shown. Presented results show that a symmetrical dense MoM matrix with 100 000 complex unknowns in single precision can be LU decomposed in about 3.5 minutes, on a personal computer equipped with 4 GTX 1080 Ti GPUs.
机译:本文提出了通过并行使用多个GPU来加速对称密集MoM矩阵的核内LU分解的过程。通过使用块LU分解克服了GPU的内存限制,其中整个系统矩阵存储在CPU RAM中,而只有经过处理的块存储在GPU VRAM中。所提出的用于LU分解的算法能够高效利用任意数量的GPU。显示了多达4个GTX 680 GPU和多达4个GTX 1080 Ti GPU的性能比较。呈现的结果表明,在配备有4个GTX 1080 Ti GPU的个人计算机上,可以在大约3.5分钟的时间内对具有单精度10万个复杂未知数的对称密集MoM矩阵进行LU分解。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号