In this work, we make use of the OpenCL framework to accelerate an EMRI modeling application using the hardware accelerators - Cell BE and Tesla CUDA GPU. We describe these compute technologies and our parallelization approach in detail, present our performance results, and then compare them with those from our previous implementations based on the native CUDA and Cell SDKs. The OpenCL framework allows us to execute identical source-code on both architectures and yet obtain strong performance gains that are comparable to what can be derived from the native SDKs.
展开▼
机译:在这项工作中,我们利用OpenCL框架通过Cell BE和Tesla CUDA GPU等硬件加速器来加速EMRI建模应用程序。我们将详细描述这些计算技术和并行化方法,展示我们的性能结果,然后将其与基于本地CUDA和Cell SDK的先前实现的结果进行比较。 OpenCL框架使我们能够在两种架构上执行相同的源代码,并且获得可与本机SDK媲美的强大性能。
展开▼