首页> 外文期刊>Parallel and Distributed Systems, IEEE Transactions on >Design of a Multithreaded Barnes-Hut Algorithm for Multicore Clusters
【24h】

Design of a Multithreaded Barnes-Hut Algorithm for Multicore Clusters

机译:多核集群的多线程Barnes-Hut算法设计

获取原文
获取原文并翻译 | 示例
           

摘要

We describe in this paper an implementation of the Barnes-Hut algorithm on multicore clusters. Based on a partitioned global address space (PGAS) library, the design integrates intranode multithreading and internode one-sided communication, exemplifying a PGAS X programming style. Within a node, the computation is decomposed into tasks (subtasks) and multitasking is used to hide network latency. We study the tradeoffs between locality in private caches and locality in shared caches and bring the insights into the design. As a result, our implementation consumes less memory per core, invokes less internode communication, and enjoys better load-balancing strategies. The final code achieves up to 41 percent performance improvement over a non-multithreaded counterpart. Through detailed comparison, we also show its advantages over other well-known Barnes-Hut implementations, both in programming complexity and in performance.
机译:我们在本文中描述了Barnes-Hut算法在多核群集上的实现。该设计基于分区的全局地址空间(PGAS)库,集成了节点内多线程和节点间单侧通信,从而体现了PGAS X编程风格。在节点内,将计算分解为任务(子任务),并使用多任务处理来隐藏网络延迟。我们研究了私有缓存中的位置与共享缓存中的位置之间的权衡,并将洞察力引入设计中。结果,我们的实现消耗了每个内核更少的内存,调用了更少的节点间通信,并享受了更好的负载平衡策略。最终的代码与非多线程代码相比,性能提高了41%。通过详细的比较,我们还显示了它在编程复杂性和性能方面均优于其他著名的Barnes-Hut实现。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号