Dynamic Binary Vectorization in Enhanced HQEMU

碩士 === 國立臺灣大學 === 資訊網路與多媒體研究所 === 107 === Auto vectorization techniques have been adopted by compilers to exploit data-level parallelism in parallel processing for decades. However, since processor architectures have kept enhancing with new features to improve vector/SIMD performance, legacy applica...

Full description

Bibliographic Details
Main Authors:	Chih-Min Lin, 林致民
Other Authors:	徐慰中
Format:	Others
Language:	en_US
Published:	2019
Online Access:	http://ndltd.ncl.edu.tw/handle/9mwuvj

id	ndltd-TW-107NTU05641027
record_format	oai_dc
spelling	ndltd-TW-107NTU056410272019-11-16T05:28:00Z http://ndltd.ncl.edu.tw/handle/9mwuvj Dynamic Binary Vectorization in Enhanced HQEMU 基於進階HQEMU之動態二進制碼向量化 Chih-Min Lin 林致民碩士國立臺灣大學資訊網路與多媒體研究所 107 Auto vectorization techniques have been adopted by compilers to exploit data-level parallelism in parallel processing for decades. However, since processor architectures have kept enhancing with new features to improve vector/SIMD performance, legacy application binaries failed to fully exploit new vector/SIMD capabilities in modern architectures. For example, legacy ARMv7 binaries cannot benefit from ARMv8 SIMD double precision capability, and legacy x86 binaries cannot enjoy the power of AVX-512 extensions. In this thesis, we study the fundamental issues involved in cross-ISA Dynamic Binary Translation (DBT) to convert non-vectorized loops to vector/SIMD forms to achieve greater computation throughput available in newer processor architectures. The key idea is to recover critical loop information from those application binaries in order to carry out vectorization at runtime. Experiment results show that our approach achieves an average speedup of 1.42x compared to ARMv7 native run across various benchmarks in an ARMv7-to-ARMv8 dynamic binary translation system. 徐慰中 2019 學位論文 ; thesis 36 en_US
collection	NDLTD
language	en_US
format	Others
sources	NDLTD
description	碩士 === 國立臺灣大學 === 資訊網路與多媒體研究所 === 107 === Auto vectorization techniques have been adopted by compilers to exploit data-level parallelism in parallel processing for decades. However, since processor architectures have kept enhancing with new features to improve vector/SIMD performance, legacy application binaries failed to fully exploit new vector/SIMD capabilities in modern architectures. For example, legacy ARMv7 binaries cannot benefit from ARMv8 SIMD double precision capability, and legacy x86 binaries cannot enjoy the power of AVX-512 extensions. In this thesis, we study the fundamental issues involved in cross-ISA Dynamic Binary Translation (DBT) to convert non-vectorized loops to vector/SIMD forms to achieve greater computation throughput available in newer processor architectures. The key idea is to recover critical loop information from those application binaries in order to carry out vectorization at runtime. Experiment results show that our approach achieves an average speedup of 1.42x compared to ARMv7 native run across various benchmarks in an ARMv7-to-ARMv8 dynamic binary translation system.
author2	徐慰中
author_facet	徐慰中 Chih-Min Lin 林致民
author	Chih-Min Lin 林致民
spellingShingle	Chih-Min Lin 林致民 Dynamic Binary Vectorization in Enhanced HQEMU
author_sort	Chih-Min Lin
title	Dynamic Binary Vectorization in Enhanced HQEMU
title_short	Dynamic Binary Vectorization in Enhanced HQEMU
title_full	Dynamic Binary Vectorization in Enhanced HQEMU
title_fullStr	Dynamic Binary Vectorization in Enhanced HQEMU
title_full_unstemmed	Dynamic Binary Vectorization in Enhanced HQEMU
title_sort	dynamic binary vectorization in enhanced hqemu
publishDate	2019
url	http://ndltd.ncl.edu.tw/handle/9mwuvj
work_keys_str_mv	AT chihminlin dynamicbinaryvectorizationinenhancedhqemu AT línzhìmín dynamicbinaryvectorizationinenhancedhqemu AT chihminlin jīyújìnjiēhqemuzhīdòngtàièrjìnzhìmǎxiàngliànghuà AT línzhìmín jīyújìnjiēhqemuzhīdòngtàièrjìnzhìmǎxiàngliànghuà
_version_	1719292828698279936

Dynamic Binary Vectorization in Enhanced HQEMU

Similar Items