...
首页> 外文期刊>International Journal for Numerical Methods in Fluids >Hybrid spectral finite difference simulations of stratified turbulent flows on distributed memory architectures
【24h】

Hybrid spectral finite difference simulations of stratified turbulent flows on distributed memory architectures

机译:分布式存储架构上分层湍流的混合谱有限差分模拟

获取原文
获取原文并翻译 | 示例
           

摘要

A method for efficient implementation of a combined spectral finite difference algorithm for computation of incompressible stratified turbulent flows on distributed memory computers is presented. The solution technique is the fractional step method with a semi-implicit time advancement scheme. A single-programme multiple-data abstraction is used in conjunction with a static data-partitioning scheme. The distributed FFTs required in the explicit step are based on the transpose method and the large sets of independent tridiagonal systems of equations arising in the implicit steps are solved using the pipelined Thomas algorithm. A speed-up analysis of a model problem is presented for three partitioning schemes, namely unipartition, multipartition and transpose partition. It is shown that the unipartitioning scheme is best suited for this algorithm. Performance measurements of the overall as well as individual stages of the algorithm are presented for several different grids and are discussed in the context of associated dependency and communication overheads. An unscaled speed-up efficiency of up to 91% on doubling the number of processors and up to 60% on an eightfold increase in the number of processors was obtained on the Intel Paragon and iPSC/860 Hypercube. Absolute performance of the code was evaluated by comparisons with performance on the Cray-YMP. On 128 Paragon processors, performance up to five times that of a single-processor Cray-YMP was obtained. The validation of the method and results of grid refinement studies in stably stratified turbulent channel flows are presented.
机译:提出了一种有效实现组合频谱有限差分算法的方法,用于在分布式存储计算机上计算不可压缩的分层湍流。解决方法是采用半隐式时间提前方案的分数步法。单程序多数据抽象与静态数据分区方案结合使用。显式步骤中需要的分布式FFT基于转置方法,隐式步骤中出现的大量独立的三对角方程组使用流水线Thomas算法进行求解。针对单分区,多分区和转置分区三种分区方案,对模型问题进行了加速分析。结果表明,单分区方案最适合该算法。针对几个不同的网格介绍了算法的整体以及各个阶段的性能,并在相关的依赖性和通信开销的背景下进行了讨论。在Intel Paragon和iPSC / 860 Hypercube上获得的无比例加速效率在处理器数量增加一倍时达到91%,在处理器数量增加八倍时达到60%。通过与Cray-YMP上的性能进行比较来评估代码的绝对性能。在128个Paragon处理器上,性能达到单处理器Cray-YMP的五倍。提出了方法的验证以及在稳定分层湍流通道中网格细化研究的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号