首页> 外文会议>ICA3PP 2014 >GPU Acceleration of Finding Maximum Eigenvalue of Positive Matrices

【24h】

GPU Acceleration of Finding Maximum Eigenvalue of Positive Matrices

机译：GPU加速找到正矩阵的最大特征值

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Matrix eigenvalue theory has become an important analysis tool in scientific computing. Sometimes, people do not need to find all eigenvalues but only the maximum eigenvalue. Existing algorithms of finding the maximum eigenvalue of matrices are implemented sequentially. With the increasing of the orders of matrices, the workload of calculation is getting heavier. Therefore, traditional sequential methods are unable to meet the need of fast calculation for large matrices. This paper proposes a parallel algorithm named PA-ST to find the maximum eigenvalue of positive matrices by using similarity transformation which is implemented by CUDA (Computer Unified Device Architecture) on GPU (Graphic Process Unit). To the best of our knowledge, this is the first CUDA based parallel algorithm of calculating maximum eigenvalue of matrices. In order to improve the performance, optimization techniques are applied in this paper such as using the shared memory rather than the global memory to improve the speed of computation, avoiding bank conflicts by setting the span index, satisfying the principle of coalesced memory access, and by using single-precision floating-point arithmetic and the pinned memory to reduce the copy operation and obtain higher data transfer bandwidth between the host and the GPU device. The experimental results show that the similarity transformation technique can significantly shorten the running time compared to the sequential algorithm and the speedup ratio is nearly stable when the number of iterations increases. As the matrix order increases, the running time of the sequential algorithm and PA-ST increases correspondingly. Experiments also show that the speedup ratio of the PA-ST is between 2.85 and 35.028.

机译：矩阵特征值理论已成为科学计算中的重要分析工具。有时，人们不需要找到所有特征值，而是只有最大的特征值。依次实现找到矩阵最大特征值的现有算法。随着矩阵秩序的增加，计算的工作量正在变得越来越重。因此，传统的顺序方法无法满足大矩阵快速计算的需要。本文提出了一种名为PA-ST的并行算法，通过使用CUDA（计算机统一设备架构）在GPU（图形处理单元）实现的相似性转换来找到正矩阵的最大特征值。据我们所知，这是第一个基于CUDA的并行算法计算了矩阵最大特征值。为了提高性能，在本文中应用优化技术，例如使用共享内存而不是全局内存来提高计算速度，通过设置跨度指数来避免银行冲突，满足聚结的内存访问的原理，以及通过使用单精度浮点算术和固定存储器来减少复制操作并在主机和GPU设备之间获得更高的数据传输带宽。实验结果表明，与顺序算法相比，相似性变换技术可以显着缩短运行时间，并且当迭代的数量增加时，加速比几乎稳定。随着矩阵顺序增加，顺序算法和PA-ST的运行时间相应地增加。实验还表明，PA-ST的加速比在2.85和35.028之间。

著录项

来源
《ICA3PP 2014》|2014年||共14页
会议地点
作者
Ning Tian; Longjiang Guo; Chunyu Ai; Meirui Ren; Jinbao Li;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP302.1-532;
关键词
Maximum Eigenvalue; Positive Matrix; Similarity Transformation; GPU; CUDA;

机译：最大特征值;正矩阵;相似性转型;GPU;CUDA;
入库时间 2022-08-20 22:42:34

相似文献

外文文献
中文文献
专利

1. Energy-aware acceleration on GPUs: Findings on a bioinformatics benchmark [J] . Perez J., Rodriguez A., Chico J. F., Sustainable Computing . 2018,第DECa期

机译：GPU上的能源感知加速：生物信息学基准测试的发现
2. Some inequalities for eigenvalues and symplectic eigenvalues of positive definite matrices [J] . Bhatia Rajendra International journal of mathematics . 2019,第11期

机译：正定矩阵的特征值和辛意识的一些不等式
3. Indefinite copositive matrices with exactly one positive eigenvalue or exactly one negative eigenvalue [J] . Jargalsaikhan Bolor The Electronic Journal of Linear Algebra . 2013,第1期

机译：具有正一个正特征值或正一个负特征值的不定正定矩阵
4. GPU Acceleration of Finding Maximum Eigenvalue of Positive Matrices [C] . Ning Tian, Longjiang Guo, Chunyu Ai, International conference on algorithms and architectures for parallel processing . 2014

机译：查找正矩阵最大特征值的GPU加速
5. Finding a few eigenvalues of large sparse nonsymmetric matrices. [D] . Hagerty, Gary William. 2001

机译：查找大型稀疏非对称矩阵的一些特征值。
6. Computation of all eigenvalues of matrices used in restricted maximum likelihood estimation of variance components using sparse matrix techniques [O] . C Robert, V Ducrocq 1996

机译：使用稀疏矩阵技术计算方差分量的受限最大似然估计中使用的矩阵的所有特征值
7. Acceleration of Hessenberg Reduction for Nonsymmetric Eigenvalue Problems in a Hybrid CPU-GPU Computing Environment [O] . Jun-ichi Muramatsu, Takeshi Fukaya, Shao-liang Zhang, 2011

机译：混合CpU-GpU计算环境中非对称特征值问题的Hessenberg减少加速
8. Finding Eigenvalues and Eigenvectors of Unsymmetric Matrices Using a Distributed-Memory Multiprocessor [R] . Geist, G. A. , Davis, G. J. 1988

机译：利用分布式内存多处理器寻找非对称矩阵的特征值和特征向量

GPU Acceleration of Finding Maximum Eigenvalue of Positive Matrices

摘要

著录项

相似文献

相关主题

期刊订阅