A Study of Single and Multi-device Synchronization Methods in Nvidia GPUs

机译：Nvidia GPU中的单设备和多设备同步方法的研究

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

GPUs are playing an increasingly important role in general-purpose computing. Many algorithms require synchronizations at different levels of granularity in a single GPU. Additionally, the emergence of dense GPU nodes also calls for multi-GPU synchronization. Nvidia’s latest CUDA provides a variety of synchronization methods. Until now, there is no full understanding of the characteristics of those synchronization methods. This work explores important undocumented features and provides an in-depth analysis of the performance considerations and pitfalls of the state-of-art synchronization methods for Nvidia GPUs. The provided analysis would be useful when making design choices for applications, libraries, and frameworks running on single and/or multi-GPU environments. We provide a case study of the commonly used reduction operator to illustrate how the knowledge gained in our analysis can be useful. We also describe our micro-benchmarks and measurement methods.

机译：GPU在通用计算中扮演着越来越重要的角色。许多算法需要在单个GPU中以不同的粒度级别进行同步。此外，密集GPU节点的出现也要求进行多GPU同步。 Nvidia的最新CUDA提供了多种同步方法。到目前为止，还没有完全了解这些同步方法的特征。这项工作探索了重要的未记录功能，并对Nvidia GPU的最新同步方法的性能注意事项和陷阱进行了深入分析。当为在单GPU和/或多GPU环境中运行的应用程序，库和框架做出设计选择时，提供的分析将很有用。我们提供了一个常用还原算子的案例研究，以说明在我们的分析中获得的知识如何有用。我们还将描述我们的微基准和测量方法。

著录项

来源
《IEEE International Parallel and Distributed Processing Symposium》|2020年|483-493|共11页
会议地点
作者
Lingqi Zhang; Mohamed Wahib; Haoyu Zhang; Satoshi Matsuoka;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
CUDA Barrier; Synchronization; GPUs;

机译：CUDA屏障;同步; GPU;

相似文献

外文文献
中文文献
专利

1. Optimizing Finite Volume Method Solvers on Nvidia GPUs [J] . Xu Jingheng, Fu Haohuan, Luk Wayne, IEEE Transactions on Parallel and Distributed Systems . 2019,第12期

机译：在Nvidia GPU上优化有限体积方法求解器
2. HbbTV-Compliant Platform for Hybrid Media Delivery and Synchronization on Single- and Multi-Device Scenarios [J] . Fernando Boronat, Dani Marfil, Mario Montagud, Broadcasting, IEEE Transactions on . 2018,第3期

机译：兼容HbbTV的平台，可在单设备和多设备方案中进行混合媒体交付和同步
3. Collision detection of convex polyhedra on the NVIDIA GPU architecture for the discrete element method [J] . Govender Nicolin, Wilke Daniel N., Kok Schalk Applied mathematics and computation . 2015,第Null期

机译：离散元素方法在NVIDIA GPU架构上的凸多面体碰撞检测
4. GPU-Centric Communication on NVIDIA GPU Clusters with InfiniBand: A Case Study with OpenSHMEM [C] . Sreeram Potluri, Anshuman Goswami, Davide Rossetti, 2017 IEEE 24th International Conference on High Performance Computing . 2017

机译：带有InfiniBand的NVIDIA GPU群集上以GPU为中心的通信：以OpenSHMEM为例
5. Method of Moments Modeling of Single Layer Microstrip Patch Antennas using GPU Acceleration and Quasi-Monte Carlo Integration. [D] . Cerjanic, Alexander M. 2012

机译：使用GPU加速和拟蒙特卡洛积分的单层微带贴片天线矩建模方法。
6. What Influences Adolescent Girls’ Decision-Making Regarding Contraceptive Methods Use and Childbearing? A Qualitative Exploratory Study in Rangpur District Bangladesh [O] . A. S. M. Shahabuddin, Christiana Nöstlinger, Thérèse Delvaux, -1

机译：是什么影响少女在避孕方法的使用和生育方面的决策？孟加拉邦布尔区的定性探索性研究
7. A Study of Single and Multi-device Synchronization Methods in Nvidia GPUs [O] . Lingqi Zhang, Mohamed Wahib, Haoyu Zhang, 2020

机译：NVIDIA GPU中的单无器件同步方法研究

A Study of Single and Multi-device Synchronization Methods in Nvidia GPUs

摘要

著录项

相似文献

相关主题

期刊订阅