Compiler Directed Parallelization of Loops in Scale for Shared-Memory Multiprocessors

机译：共享内存多处理器的编译器定向大规模循环并行化

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Effective utilization of symmetric shared-memory multiprocessors (SMPs) is predicated on the development of efficient parallel code. Unfortunately, efficient parallelism is not always easy for the programmer to identify. Worse, exploiting such parallelism may directly conflict with optimizations affecting per-processor utilization (i.e. loop reordering to improve data locality). Here, we present our experience with a loop-level parallel compiler optimization for SMPs proposed by McKinley. The algorithm uses dependence analysis and a simple model of the target machine, to transform nested loops. The goal of the approach is to promote efficient execution of parallel loops by exposing sources of large-grain parallel work while maintaining per-processor locality. We implement the optimization within the Scale compiler framework, and analyze the performance of multiprocessor code produced for three microbenchmarks.

机译：对称共享内存多处理器（SMP）的有效利用取决于高效并行代码的开发。不幸的是，对于程序员来说，高效的并行性并不总是那么容易。更糟糕的是，利用这种并行性可能会直接影响到影响每个处理器利用率的优化（即通过循环重排序来改善数据局部性）。在这里，我们介绍由McKinley提出的针对SMP的循环级并行编译器优化的经验。该算法使用依赖性分析和目标计算机的简单模型来转换嵌套循环。该方法的目标是通过公开大粒度并行工作的源代码，同时保持每个处理器的局部性，从而促进并行循环的高效执行。我们在Scale编译器框架内实施优化，并分析针对三个微基准测试产生的多处理器代码的性能。

著录项

来源
《International Conference on Computational Science - ICCS 2003 Pt.3 Jun 2-4, 2003 Melbourne, Australia and St. Petersburg, Russia》|2003年|p.946-955|共10页
会议地点 Melbourne(AU) St. Petersburg(RU);Melbourne(AU) St. Petersburg(RU)
作者
Gregory S. Johnson; Simha Sethumadhavan;
展开▼
作者单位

Department of Computer Sciences Texas Advanced Computing Center The University of Texas at Austin, Austin TX 78712, USA;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类自动化技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. Automatic partitioning of parallel loops and data arrays for distributed shared-memory multiprocessors [J] . Agarwal A., Kranz D.A. IEEE Transactions on Parallel and Distributed Systems . 1995,第9期

机译：分布式共享内存多处理器的并行循环和数据数组的自动分区
2. Minimizing the Directory Size for Large-Scale Shared-Memory Multiprocessors [J] . Jinseok KONG, Pen-Chung YEW, Gyungho LEE IEICE Transactions on Information and Systems . 2005,第11期

机译：最小化大型共享内存多处理器的目录大小
3. Techniques for Improving the Performance and Scalability of Directory-based Shared-Memory Multiprocessors: A Survey [J] . Manuel E. Acacio, Jos?? M. Garc?-a Journal of Computer Science and Technology . 2003,第2期

机译：改进基于目录的共享内存多处理器的性能和可伸缩性的技术：一项调查
4. Compiler Directed Parallelization of Loops in Scale for Shared-Memory Multiprocessors [C] . Gregory S. Johnson, Simha Sethumadhavan International conference on computational science . 2003

机译：编译器针对共享内存多处理器的循环指向并行化
5. Architectural support for scalable speculative parallelization in shared-memory multiprocessors. [D] . Cintra, Marcelo Hehl. 2001

机译：对共享内存多处理器中的可伸缩投机并行化的体系结构支持。
6. PANET: A GPU-Based Tool for Fast Parallel Analysis of Robustness Dynamics and Feed-Forward/Feedback Loop Structures in Large-Scale Biological Networks [O] . Hung-Cuong Trinh, Duc-Hau Le, Yung-Keun Kwon -1

机译：PANET：基于GPU的工具可快速并行分析大型生物网络中的鲁棒性动力学和前馈/反馈回路结构
7. Compiler Directed Parallelization of Loops in Scale for Shared-Memory Multiprocessors [O] . Gregory S. Johnson, Simha Sethumadhavan 2009

机译：共享内存多处理器的编译器定向大规模循环并行化

Compiler Directed Parallelization of Loops in Scale for Shared-Memory Multiprocessors

摘要

著录项

相似文献

相关主题

期刊订阅