Exploiting Locality in Sparse Matrix-Matrix Multiplication on Many-Core Architectures

Kadir Akbudak; Cevdet Aykanat

首页> 外文期刊>IEEE Transactions on Parallel and Distributed Systems >Exploiting Locality in Sparse Matrix-Matrix Multiplication on Many-Core Architectures

【24h】

Exploiting Locality in Sparse Matrix-Matrix Multiplication on Many-Core Architectures

机译：在多核架构上利用稀疏矩阵-矩阵乘法的局部性

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Exploiting spatial and temporal localities is investigated for efficient row-by-row parallelization of general sparse matrix-matrix multiplication (SpGEMM) operation of the form C=AB on many-core architectures. Hypergraph and bipartite graph models are proposed for 1D rowwise partitioning of matrix A to evenly partition the work across threads with the objective of reducing the number of B -matrix words to be transferred from the memory and between different caches. A hypergraph model is proposed for B -matrix column reordering to exploit spatial locality in accessing entries of thread-private temporary arrays, which are used to accumulate results for C -matrix rows. A similarity graph model is proposed for B -matrix row reordering to increase temporal reuse of these accumulation array entries. The proposed models and methods are tested on a wide range of sparse matrices from real applications and the experiments were carried on a 60-core Intel Xeon Phi processor, as well as a two-socket Xeon processor. Results show the validity of the models and methods proposed for enhancing the locality in parallel SpGEMM operations.

机译：研究了利用空间和时间局部性来有效地对多核体系结构上的C = AB形式的通用稀疏矩阵-矩阵乘法（SpGEMM）操作进行逐行并行化。提出了超图和二部图模型用于矩阵A的一维行分区，以跨线程均匀地划分工作，目的是减少要从内存和不同缓存之间传输的B矩阵字的数量。提出了一种用于B矩阵列重排的超图模型，以利用空间局部性来访问线程专用临时数组的条目，这些线程用于累积C矩阵行的结果。提出了一种相似图模型用于B矩阵行重排序，以增加这些累积数组项的时间重用。所提出的模型和方法在实际应用中的各种稀疏矩阵上进行了测试，并且实验在60核Intel Xeon Phi处理器以及两路Xeon处理器上进行。结果表明，提出的用于在并行SpGEMM运算中增强局部性的模型和方法的有效性。

著录项

来源
《IEEE Transactions on Parallel and Distributed Systems》 |2017年第8期|2258-2271|共14页
作者
Kadir Akbudak; Cevdet Aykanat;
展开▼
作者单位

Computer Engineering Department, Bilkent University, Ankara, Turkey;

Computer Engineering Department, Bilkent University, Ankara, Turkey;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Sparse matrices; Computer architecture; Instruction sets; Computational modeling; Bipartite graph; Parallel processing; Data models;

机译：稀疏矩阵;计算机体系结构;指令集;计算建模;二部图;并行处理;数据模型;

相似文献

外文文献
中文文献
专利

1. Multithreaded sparse matrix-matrix multiplication for many-core and GPU architectures [J] . Deveci Mehmet, Trott Christian, Rajamanickam Sivasankaran Parallel Computing . 2018,第octa期

机译：适用于多核和GPU架构的多线程稀疏矩阵矩阵乘法
2. MEMORY-EFFICIENT SPARSE MATRIX-MATRIX MULTIPLICATION BY ROW MERGING ON MANY-CORE ARCHITECTURES [J] . Gremse Felix, Kuepper Kerstin, Naumann Uwe SIAM Journal on Scientific Computing . 2018,第4期

机译：在许多核心架构上的行合并的内存高效的稀疏矩阵乘法
3. Locality-aware parallel block-sparse matrix-matrix multiplication using the Chunks and Tasks programming model [J] . Rubensson Emanuel H., Rudberg Elias Parallel Computing . 2016,第SEPa期

机译：使用块和任务编程模型的局部性并行块稀疏矩阵矩阵乘法
4. Performance-Portable Sparse Matrix-Matrix Multiplication for Many-Core Architectures [C] . Mehmet Deveci, Christian Trott, Sivasankaran Rajamanickam IEEE International Parallel and Distributed Processing Symposium Workshops . 2017

机译：用于多核架构的性能便携式稀疏矩阵矩阵乘法
5. Sparse Matrix Multiplication on a Many-core Platform [D] . Shi, Peiyao. 2018

机译：许多核心平台上的稀疏矩阵乘法
6. High-Performance 3D Compressive Sensing MRI Reconstruction Using Many-Core Architectures [O] . Daehyun Kim, Joshua Trzasko, Mikhail Smelyanskiy, 2011

机译：使用多核架构的高性能3D压缩传感MRI重建
7. Exploiting Locality in Sparse Matrix-Matrix Multiplication on Many-Core Architectures [O] . Kadir Akbudak, Cevdet Aykanat 2017

机译：利用稀疏矩阵 - 矩阵乘法的临时函数在多核架构上

Exploiting Locality in Sparse Matrix-Matrix Multiplication on Many-Core Architectures

摘要

著录项

相似文献

相关主题

期刊订阅