首页> 外文学位 >Run-time support and compilation methods for irregular computations on distributed memory parallel machines.
【24h】

Run-time support and compilation methods for irregular computations on distributed memory parallel machines.

机译:分布式内存并行机上不规则计算的运行时支持和编译方法。

获取原文
获取原文并翻译 | 示例

摘要

In recent years, distributed memory parallel machines have been widely recognized as the most likely means of achieving teraflops performance. However, programming a distributed memory machine to get good speed-ups and efficiency proves to be cumbersome. To ease the task of programming parallel machines, recently there have been major efforts in developing programming language and compiler support for distributed memory machines.; There exists a class of scientific and engineering applications, called irregular applications, in which many of the optimizations can be done only at runtime. This constraint presents a greater challenge for compilers. This research provides solutions for compiling irregular problems. This thesis presents a combined runtime and compile-time approach for parallelizing this general class of applications on distributed memory machines. It presents a runtime system that has been designed and implemented for parallelizing these applications on distributed memory machines. Methods by which compilers for High Performance Fortran (HPF) style parallel programming languages can automatically generate calls to the runtime system are also presented.; The runtime system supports the partitioning of loop iterations to maintain data locality, the coupling of data partitioners to obtain non-standard distribution, the remapping of data structures and optimizations such as vectorization, aggregation and schedule reuse. The compiler techniques have been implemented in the Fortran 90D/HPF compiler being developed at Syracuse University. The runtime and compile-time approaches have been evaluated using templates from real scientific applications. Performance results of Fortran 90D compiler-parallelized codes are compared with that of hand-parallelized codes. It is observed that the compiler-generated codes perform within 15% of the hand-parallelized codes.
机译:近年来,分布式存储器并行机已被广泛认为是实现万亿次浮点运算性能的最可能手段。但是,对分布式存储机器进行编程以获得良好的速度和效率被证明很麻烦。为了减轻对并行机进行编程的任务,最近在开发用于分布式存储机的编程语言和编译器支持方面做出了巨大的努力。存在一类称为不规则应用程序的科学和工程应用程序,其中许多优化只能在运行时进行。该约束对编译器提出了更大的挑战。这项研究为解决不规则问题提供了解决方案。本文提出了一种组合的运行时和编译时方法,用于并行化分布式存储机器上的此类通用应用程序。它提供了一个运行时系统,该系统已设计和实现为并行化分布式存储计算机上的这些应用程序。还介绍了用于高性能Fortran(HPF)样式的并行编程语言的编译器可以自动生成对运行时系统的调用的方法。运行时系统支持循环迭代的分区以维护数据局部性,数据分区器的耦合以获得非标准的分布,数据结构的重新映射以及诸如矢量化,聚合和调度重用之类的优化。编译器技术已在Syracuse University开发的Fortran 90D / HPF编译器中实现。使用来自实际科学应用程序的模板对运行时和编译时方法进行了评估。将Fortran 90D编译器并行化代码的性能结果与手工并行化代码的性能结果进行比较。可以观察到,编译器生成的代码在手动并行代码的15%内执行。

著录项

  • 作者

    Ponnusamy, Ravi.;

  • 作者单位

    Syracuse University.;

  • 授予单位 Syracuse University.;
  • 学科 Computer Science.
  • 学位 Ph.D.
  • 年度 1995
  • 页码 132 p.
  • 总页数 132
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 自动化技术、计算机技术;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号