Efficient and Retargetable Dynamic Binary Translation on Multicores

Ding-Yong Hong; Jan-Jan Wu; Pen-Chung Yew; Wei-Chung Hsu; Chun-Chen Hsu; Pangfeng Liu; Chien-Min Wang; Yeh-Ching Chung

首页> 外文期刊>IEEE Transactions on Parallel and Distributed Systems >Efficient and Retargetable Dynamic Binary Translation on Multicores

【24h】

Efficient and Retargetable Dynamic Binary Translation on Multicores

机译：多核上高效且可重定向的动态二进制翻译

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Dynamic binary translation (DBT) is a core technologyto many important applications such as system virtualization, dynamic binary instrumentation, and security. However, there are several factors that often impede its performance: 1) emulation overhead before translation; 2) translation and optimization overhead; and 3) translated code quality. The issues also include its retargetabilitythat supports guest applications from different instruction-set architectures (ISAs) to host machines also with different ISAs-an important feature to system virtualization. In this work, we take advantage of the ubiquitous multicore platforms, and use a multithreaded approach to implement DBT. By running the translator and the dynamic binary optimizer on different cores with different threads, it could off-load the overhead incurred by DBT on the target applications; thus, afford DBT of more sophisticated optimization techniques as well as its retargetability. Using QEMU (a popular retargetable DBT for system virtualization) and Low-Level Virtual Machine (LLVM) as our building blocks, we demonstrated in a multithreaded DBT prototype, called Hybrid-QEMU (HQEMU), that it could improve QEMU performance by a factor of 2.6x and 4.1x on the SPEC CPU2006 integer and floating point benchmarks, respectively, for dynamic translation of x86 code to run on x86-64 platforms. For ARM codes to x86-64 platforms, HQEMU can gain a factor of 2.5x speedup over QEMU for the SPEC CPU2006 integer benchmarks. We also address the performance scalability issue of multithreaded applications across ISAs. We identify two major impediments to performance scalability in QEMU: 1) coarse-grained locks used to protect shared data structures, and 2) inefficient emulation of atomic instructions across ISAs. We proposed two techniques to mitigate those problems: 1) using indirect branch translation caching (IBTC) to avoid frequent accesses to locks, and 2) using lightweight memory transactions to emulate atomic instru- tions across ISAs. Our experimental results show that for multithread applications, HQEMU achieves 25X speedups over QEMU for the PARSEC benchmarks.

机译：动态二进制转换（DBT）是许多重要应用程序的核心技术，例如系统虚拟化，动态二进制检测和安全性。但是，有几个因素通常会阻碍其性能：1）翻译前的仿真开销； 2）翻译和优化开销；和3）翻译代码质量。问题还包括其可重新定向性，以支持来自不同指令集体系结构（ISA）的来宾应用程序到也具有不同ISA的主机的应用程序-这是系统虚拟化的重要功能。在这项工作中，我们利用了无处不在的多核平台，并使用多线程方法来实现DBT。通过在具有不同线程的不同内核上运行转换器和动态二进制优化器，可以减轻DBT在目标应用程序上产生的开销；因此，可以为DBT提供更复杂的优化技术及其可重定向性。使用QEMU（用于系统虚拟化的流行的可重定位DBT）和低级虚拟机（LLVM）作为我们的构建块，我们在称为Hybrid-QEMU（HQEMU）的多线程DBT原型中演示了它可以将QEMU性能提高一倍。分别针对SPEC CPU2006整数和浮点基准测试分别设置了2.6x和4.1x，以动态转换x86代码以在x86-64平台上运行。对于针对x86-64平台的ARM代码，对于SPEC CPU2006整数基准，HQEMU的速度是QEMU的2.5倍。我们还将解决跨ISA的多线程应用程序的性能可伸缩性问题。我们确定了QEMU中性能可伸缩性的两个主要障碍：1）用于保护共享数据结构的粗粒度锁； 2）跨ISA的原子指令的低效仿真。我们提出了两种缓解这些问题的技术：1）使用间接分支转换缓存（IBTC）以避免频繁访问锁，以及2）使用轻量级内存事务在ISA之间模拟原子指令。我们的实验结果表明，对于多线程应用程序，对于PARSEC基准，HQEMU的速度比QEMU快25倍。

著录项

来源
《IEEE Transactions on Parallel and Distributed Systems》 |2014年第3期|622-632|共11页
作者
Ding-Yong Hong; Jan-Jan Wu; Pen-Chung Yew; Wei-Chung Hsu; Chun-Chen Hsu; Pangfeng Liu; Chien-Min Wang; Yeh-Ching Chung;
展开▼
作者单位

Dept. of Comput. Sci., Nat. Tsing Hua Univ., Hsinchu, Taiwan|c|;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Dynamic binary translation; feedback-directed optimization; hardware performance monitoring; multicores; traces;

机译：动态二进制翻译;反馈定向优化;硬件性能监控;多核;跟踪;

相似文献

外文文献
中文文献
专利

1. Efficient and retargetable SIMD translation in a dynamic binary translator [J] . Fu Sheng-Yu, Hong Ding-Yong, Liu Yu-Ping, Software . 2018,第6期

机译：动态二进制转换器中的高效且可重定向的SIMD转换
2. DBILL: An Efficient and Retargetable Dynamic Binary Instrumentation Framework using LLVM Backend [J] . Yi-Hong Lyu, Ding-Yong Hong, Tai-Yi Wu, ACM SIGPLAN Notices: A Monthly Publication of the Special Interest Group on Programming Languages . 2014,第7期

机译：DBILL：使用LLVM后端的高效且可重定向的动态二进制工具框架
3. Efficiently Parallelizing Instruction Set Simulation of Embedded Multi-Core Processors Using Region-based Just-in-Time Dynamic Binary Translation [J] . Stephen Kyle, Igor B?hm, Bj?rn Franke, ACM SIGPLAN Notices: A Monthly Publication of the Special Interest Group on Programming Languages . 2012,第5期

机译：使用基于区域的即时动态二进制翻译对嵌入式多核处理器进行高效并行指令集仿真
4. Optimizing Memory Access Performance using Hardware Assisted Virtualization in Retargetable Dynamic Binary Translation [C] . Antoine Faravelon, Olivier Gruber, Frederic Petrot Euromicro Conference on Digital System Design . 2017

机译：使用硬件辅助虚拟化在Retargable动态二进制转换中优化内存访问性能
5. Sakthi: A retargetable dynamic framework for binary instrumentation. [D] . Vasudevan, Amit. 2003

机译：Sakthi：用于二进制工具的可重定目标的动态框架。
6. Hydrodynamic Behavior of the Intrinsically Disordered Potyvirus Protein VPg of the Translation Initiation Factor eIF4E and of their Binary Complex [O] . Jocelyne Walter, Amandine Barra, Bénédicte Doublet, 2019

机译：固有紊乱的杯状病毒蛋白VPg翻译起始因子eIF4E及其二元复合物的流体动力学行为
7. 1Efficient and Retargetable Dynamic Binary Translation on Multicores [O] . Ding-yong Hong, Jan-jan Wu, Pen-chung Yew, 2015

机译：1多核上的高效且可重定向的动态二进制翻译

Efficient and Retargetable Dynamic Binary Translation on Multicores

摘要

著录项

相似文献

相关主题

期刊订阅