首页> 美国政府科技报告 >Segmented Operations for Sparse Matrix Computation on Vector Multiprocessors.

【24h】

Segmented Operations for Sparse Matrix Computation on Vector Multiprocessors.

机译：矢量多处理器上稀疏矩阵计算的分段运算。

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper we present a new technique for sparse matrix multiplication on vector multiprocessors based on the efficient implementation of a segmented sum operation. We describe how the segmented sum can be implemented an vector multiprocessors such that it both fully vectorizes within each processor and parallelizes across processors. Because of our method's insensitivity to relative row size, it is better suited than the Ellpack/Itpack or the Jagged Diagonal algorithms for matrices which have a varying number of non-zero elements in each row. Furthermore, our approach requires less preprocessing (no more time than a single sparse matrix-vector multiplication), less auxiliary storage, and uses a more convenient data representation (an augmented form of the standard compressed sparse row format). We have implemented our algorithm (SEGMV) on the Cray Y-MP C90, and have compared its performance with other methods on a variety of sparse matrices from the Harwell-Boeing collection and industrial application codes. Our performance on the test matrices is up to 3 times faster than the Jagged Diagonal algorithm and up to 5 times faster than Ellpack/Itpack method. Our preprocessing time is an order of magnitude faster than for the Jagged Diagonal algorithm. Also, using an assembly language implementation of SEGMV on a 16 processor C90, the NAS Conjugate Gradient benchmark runs at 3.5 gigaflops.

著录项

作者
Blelloch, G. E.; Heroux, M. A.; Zagha, M.;
展开▼
作者单位

展开▼
年度 1993
页码 1-34
总页数 34
原文格式 PDF
正文语种 eng
中图分类工业技术;
关键词
Multiprocessors; Sparse matrix; Algorithms; Assembly languages; Formats; Gradients; Multiplication; Preprocessing; Standards; Test and evaluation; Linear systems; Iterations; Kernel functions; Computation;

机译：多处理器;稀疏矩阵;算法;汇编语言;格式;梯度;乘法;预处理;标准;测试和评估;线性系统;迭代;核函数;计算;

相似文献

外文文献
中文文献
专利

1. A sparse matrix-vector multiplication based algorithm for accurate density matrix computations on systems of millions of atoms [J] . Ghale Purnima, Johnson Harley T. Computer physics communications . 2018,第期

机译：基于稀疏的矩阵矢量乘法算法，用于数百万原子系统的精确密度矩阵计算
2. GPU accelerated sparse matrix-vector multiplication and sparse matrix-transpose vector multiplication [J] . Yuan Tao, Yangdong Deng, Shuai Mu, Concurrency and computation: practice and experience . 2015,第14期

机译：GPU加速的稀疏矩阵-向量乘法和稀疏矩阵-转置向量乘法
3. Speculative segmented sum for sparse matrix-vector multiplication on heterogeneous processors [J] . H. Sips Computing reviews . 2016,第7期

机译：异构处理器上稀疏矩阵向量乘法的推测分段和
4. Parallel sparse matrix-vector and matrix-transpose-vector multiplication using compressed sparse blocks [C] . Aydin Buluc, Jeremy T. Fineman, Matteo Frigo, 21st annual symposium on parallelism in algorithms and architectures 2009 . 2009

机译：使用压缩稀疏块的并行稀疏矩阵向量和矩阵转置向量乘法
5. Fast space-varying convolution in stray light reduction, fast matrix vector multiplication using the sparse matrix transform, and activation detection in fMRI data analysis. [D] . Wei, Jianing. 2010

机译：快速减少杂散光的空间变化卷积，使用稀疏矩阵变换的快速矩阵向量乘法以及fMRI数据分析中的激活检测。
6. Efficient Computation of the Latent Vectors of a Matrix [O] . Paul A. Samuelson 1943

机译：矩阵的潜在向量的有效计算
7. Sparse matrix algorithms on distributed memory multiprocessors. Progress report, June 1991--January 1992 [O] . A. Pothen 1992

机译：分布式内存多处理器上的稀疏矩阵算法。进度报告，1991年6月 - 1992年1月

Segmented Operations for Sparse Matrix Computation on Vector Multiprocessors.

摘要

著录项

相似文献

相关主题

期刊订阅