Cacheminer: A runtime approach to exploit cache locality on SMP

Yong Yan; Xiaodong Zhang

首页> 外文期刊>IEEE Transactions on Parallel and Distributed Systems >Cacheminer: A runtime approach to exploit cache locality on SMP

【24h】

Cacheminer: A runtime approach to exploit cache locality on SMP

机译：Cacheminer：一种在SMP上利用缓存局部性的运行时方法

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Exploiting cache locality of parallel programs at runtime is a complementary approach to a compiler optimization. This is particularly important for those applications with dynamic memory access patterns. We propose a memory-layout oriented technique to exploit cache locality of parallel loops at runtime on Symmetric Multiprocessor (SMP) systems. Guided by application-dependent and targeted architecture-dependent hints, our system, called Cacheminer, reorganizes and partitions a parallel loop using the memory-access space of its execution. Through effective runtime transformations, our system maximizes the data reuse in each partitioned data region assigned in a cache, and minimizes the data sharing among the partitioned data regions assigned to all caches. The executions of tasks in the partitions are scheduled in an adaptive and locality-presented way to minimize the execution time of programs by trading off load balance and locality. We have implemented the Cacheminer runtime library on two commercial SMP servers and an SimCS simulated SMP. Our simulation and measurement results show that our runtime approach can achieve comparable performance with the compiler optimizations for programs with regular computation and memory-access patterns, whose load balance and cache locality can be well optimized by the tiling and other program transformations. However, our experimental results show that our approach is able to significantly improve the memory performance for the applications with irregular computation and dynamic memory access patterns. These types of programs are usually hard to optimize by static compiler optimizations.

机译：在运行时利用并行程序的缓存局部性是编译器优化的一种补充方法。这对于具有动态内存访问模式的应用程序尤其重要。我们提出一种面向内存布局的技术，以在对称多处理器（SMP）系统上在运行时利用并行循环的缓存局部性。在依赖于应用程序和基于目标架构的提示的指导下，我们的系统称为Cacheminer，它使用执行的内存访问空间来重组和划分并行循环。通过有效的运行时转换，我们的系统最大程度地提高了缓存中分配的每个分区数据区域中的数据重用，并最大程度地减少了分配给所有缓存的分区数据区域之间的数据共享。分区中任务的执行以适应性和局部性的方式安排，以通过权衡负载平衡和局部性来最小化程序的执行时间。我们已经在两个商业SMP服务器和一个SimCS模拟的SMP上实现了Cacheminer运行时库。我们的仿真和测量结果表明，对于具有常规计算和内存访问模式的程序，我们的运行时方法可以达到与编译器优化相当的性能，可以通过平铺和其他程序转换很好地优化其负载平衡和缓存位置。但是，我们的实验结果表明，我们的方法能够显着提高具有不规则计算和动态内存访问模式的应用程序的内存性能。这些类型的程序通常很难通过静态编译器优化来优化。

著录项

来源
《IEEE Transactions on Parallel and Distributed Systems》 |2000年第4期|P.357-374|共18页
作者
Yong Yan; Xiaodong Zhang;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类计算技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. Exploiting Reuse Locality on Inclusive Shared Last-Level Caches [J] . JORGE ALBERICIO, PABLO IBANEZ, VICTOR VINALS, ACM Transactions on Architecture and Code Optimization . 2012,第4期

机译：在包含共享的最后一级缓存中利用重用位置
2. Spatial Locality Exploitation for Runtime Reordering of JPEG2000 Wavelet Data Layouts [J] . BERT GEELEN, VISSARION FERENTINOS, FRANCKY CATTHOOR, ACM Transactions on Design Automation of Electronic Systems . 2010,第1期

机译：JPEG2000小波数据布局的运行时重新排序的空间局部性开发
3. XY -Type GPU Cache: Exploiting Spatial Localities in both X and Y Directions to Avoid Conflict Miss [J] . Jun Zhang Chinese Journal of Electronics . 2015,第1期

机译：XY型GPU缓存：在X和Y方向上利用空间局部性以避免冲突遗漏
4. Decoupled compressed cache: Exploiting spatial locality for energy-optimized compressed caching [C] . Somayeh Sardashti, David A. Wood Annual IEEE/ACM International Symposium on Microarchitecture . 2013

机译：解耦的压缩缓存：利用空间局部性进行能源优化的压缩缓存
5. I -structure software caches: Exploiting global data locality in non-blocking multithreaded architectures [D] . Lin, Wen-Yen 2000

机译：I结构软件缓存：在非阻塞多线程体系结构中利用全局数据局部性
6. A multifactorial anti‐cachectic approach for cancer cachexia in a rat model undergoing chemotherapy [O] . Míriam Toledo, Fabio Penna, Francesc Oliva, 2016

机译：用于化疗的大鼠模型中癌症恶病质的多因素抗恶病质治疗方法
7. Cacheminer: A Runtime Approach to Exploit Cache Locality on SMP [O] . Yong Yan, Ieee Computer Society, Xiaodong Zhang, 2000

机译：Cacheminer：一种在smp上利用缓存局部性的运行时方法
8. Cache Design to Exploit Structural Locality [R] . Winstead, C. M. 1991

机译：缓存设计以利用结构局部性

Cacheminer: A runtime approach to exploit cache locality on SMP

摘要

著录项

相似文献

相关主题

期刊订阅