首页> 外文会议>International Workshop Innovative Architecture for Future Generation High-Performance Processors and Systems >Selective guarded execution using profiling on a dynamically scheduled processor
【24h】

Selective guarded execution using profiling on a dynamically scheduled processor

机译:使用动态预定的处理器上的可选择保护执行执行

获取原文

摘要

Modern dynamically scheduled processors use branch prediction hardware to speculatively fetch and execute most likely executed paths in a program. Complex branch predictors have been proposed which attempt to identify these paths accurately such that the hardware can benefit from out-of-order (OOO) execution. Recent studies have shown that inspite of such complex prediction schemes, there still exist many frequently executed branches which are difficult to predict. Predicated execution has been proposed as an alternative technique to eliminate some of these branches in various forms ranging from a restrictive support to a full-blown support. We call the restrictive form of predicated execution as guarded execution. In this paper, we propose a new algorithm which uses profiling and selectively performs if-conversion for architectures with guarded execution support. Branch profiling is used to gather the taken, non-taken and misprediction counts for every branch. This combined with block profiling is used to select paths which suffer from heavy mispredictions and are profitable to if-convert. Effects of three different selection criterias, namely size-based, predictability-based and profiled-based on net cycle improvements, branch mispredictions and mis-speculated instructions are then studied. We also explain numerous adjustments that were made to the selection criterias to better reflect the OOO processor behaviour.
机译:现代动态调度处理器使用分支预测硬件来推测和执行程序中最有可能的执行路径。已经提出了复杂的分支预测因子,该分支预测因子试图准确地识别这些路径,使得硬件可以从oround(OOO)执行中受益。最近的研究表明,这种复杂预测方案的计算机仍然存在许多常常执行的分支,这难以预测。已经提出了预先执行的执行作为消除各种形式的一些这些分支的替代技术,从限制性支撑到全吹载体。我们称之为预测执行的限制性形式为守护的执行。在本文中,我们提出了一种新的算法,它使用分析,并选择性地对架构执行架构的if-constorment进行守护的执行支持。分支分析用于为每个分支机构收集,未采取的和误需的计数。这种与块分析相结合用于选择患有沉重错误的路径,并且对if-convers有利可图。研究了三种不同选择标准,即基于尺寸的,基于可预测和异教徒的基于净循环改进,分支错误预测和错误推测指令的影响。我们还解释了对选择标准进行的众多调整,以更好地反映OOO处理器行为。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号