...
首页> 外文期刊>Journal of Computational Physics >A sparse octree gravitational N-body code that runs entirely on the GPU processor
【24h】

A sparse octree gravitational N-body code that runs entirely on the GPU processor

机译:完全在GPU处理器上运行的稀疏八叉树重力N体代码

获取原文
获取原文并翻译 | 示例
           

摘要

We present the implementation and performance of a new gravitational N-body tree-code that is specifically designed for the graphics processing unit (GPU).1The code is publicly available at: http://castle.strw.leidenuniv.nl/software.html. ~1 All parts of the tree-code algorithm are executed on the GPU. We present algorithms for parallel construction and traversing of sparse octrees. These algorithms are implemented in CUDA and tested on NVIDIA GPUs, but they are portable to OpenCL and can easily be used on many-core devices from other manufacturers. This portability is achieved by using general parallel-scan and sort methods. The gravitational tree-code outperforms tuned CPU code during the tree-construction and shows a performance improvement of more than a factor 20 overall, resulting in a processing rate of more than 2.8 million particles per second.
机译:我们介绍了专门为图形处理单元(GPU)设计的一种新的重力N体树代码的实现和性能。1该代码可从以下网站公开获得:http://castle.strw.leidenuniv.nl/software。 html。 〜1树代码算法的所有部分都在GPU上执行。我们提出了稀疏八叉树的并行构造和遍历的算法。这些算法在CUDA中实现并在NVIDIA GPU上进行了测试,但是它们可移植到OpenCL,并且可以轻松地用于其他制造商的许多核心设备上。通过使用常规的并行扫描和排序方法可以实现这种可移植性。重力树代码在树构建过程中的性能优于调整后的CPU代码,并且总体上将性能提高20倍以上,从而导致每秒处理的速度超过280万个粒子。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号