Global Optimizations in Compilers for VLIW DSP Processors with Distributed Register Files
博士 === 國立清華大學 === 資訊工程學系 === 103 === Abstract Digital signal processors (DSPs) with very long instruction word (VLIW) data-path architectures are increasingly being deployed on embedded devices for multimedia processing applications. While developing new VLIW DSP processors, engineers always take c...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | en_US |
Published: |
2015
|
Online Access: | http://ndltd.ncl.edu.tw/handle/89728810996706561122 |
id |
ndltd-TW-103NTHU5392071 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-103NTHU53920712016-08-15T04:17:33Z http://ndltd.ncl.edu.tw/handle/89728810996706561122 Global Optimizations in Compilers for VLIW DSP Processors with Distributed Register Files 應用於具有分散式暫存器組的超長指令集數位訊號處理器之全域性編譯器最佳化 Wu, Chung-Ju 吳中如 博士 國立清華大學 資訊工程學系 103 Abstract Digital signal processors (DSPs) with very long instruction word (VLIW) data-path architectures are increasingly being deployed on embedded devices for multimedia processing applications. While developing new VLIW DSP processors, engineers always take complexity, die size, and power dissipation into consideration. Therefore, some popular and traditional designs may not be feasible for embedded systems. Instead, distributed register files and multi-bank register architectures are being adopted to eliminate the amount of read/write ports associated with register files. Although such wide varieties of register file architectures and irregular designs achieve high performance and low power consumption criterion, they present challenges for devising compiler optimization schemes as well. Compiler optimizations, which direct code generation more efficiency, can be conceptually classified into local and global optimizations. Local optimizations only take place within small scope of code fragment, hence the impact of irregular designs is trivial. On the contrary, global optimizations usually go through entire procedure and try to utilize resources as effectively as possible, so the irregular designs and distributed scenarios make global optimizations difficult to have expected improvement. This dissertation has made contributions to the development and discussion of global optimizations on compilers for a novel VLIW DSP with distributed register files. The target DSP architecture, known as PAC DSP core, is designed with distinctively banked register files with highly restricted port access. Our experiences of developing global optimizations in compilers for the PAC DSP may also be of interest to those involved in developing compilers for the similar architectures. Experiments were also performed on the PAC VLIW DSP with distributed register files by incorporating our proposed optimization schemes into an Open64-based compiler. Several benchmarks such as EEMBC and MiBench were tested for evaluating the improvement of utilizing the features of the specific register file architectures. It shows that a VLIW DSP compiler applied by our global optimization schemes exhibits performance superior to traditional strategies. Lee, Jenq Kuen 李政崑 2015 學位論文 ; thesis 62 en_US |
collection |
NDLTD |
language |
en_US |
format |
Others
|
sources |
NDLTD |
description |
博士 === 國立清華大學 === 資訊工程學系 === 103 === Abstract
Digital signal processors (DSPs) with very long instruction word (VLIW) data-path architectures are increasingly being deployed on embedded devices for multimedia processing applications. While developing new VLIW DSP processors, engineers always take complexity, die size, and power dissipation into consideration. Therefore, some popular and traditional designs may not be feasible for embedded systems. Instead, distributed register files and multi-bank register architectures are being adopted to eliminate the amount of read/write ports associated with register files. Although such wide varieties of register file architectures and irregular designs achieve high performance and low power consumption criterion, they present challenges for devising compiler optimization schemes as well.
Compiler optimizations, which direct code generation more efficiency, can be conceptually classified into local and global optimizations. Local optimizations only take place within small scope of code fragment, hence the impact of irregular designs is trivial. On the contrary, global optimizations usually go through entire procedure and try to utilize resources as effectively as possible, so the irregular designs and distributed scenarios make global optimizations difficult to have expected improvement.
This dissertation has made contributions to the development and discussion of global optimizations on compilers for a novel VLIW DSP with distributed register files. The target DSP architecture, known as PAC DSP core, is designed with distinctively banked register files with highly restricted port access. Our experiences of developing global optimizations in compilers for the PAC DSP may also be of interest to those involved in developing compilers for the similar architectures.
Experiments were also performed on the PAC VLIW DSP with distributed register files by incorporating our proposed optimization schemes into an Open64-based compiler. Several benchmarks such as EEMBC and MiBench were tested for evaluating the improvement of utilizing the features of the specific register file architectures. It shows that a VLIW DSP compiler applied by our global optimization schemes exhibits performance superior to traditional strategies.
|
author2 |
Lee, Jenq Kuen |
author_facet |
Lee, Jenq Kuen Wu, Chung-Ju 吳中如 |
author |
Wu, Chung-Ju 吳中如 |
spellingShingle |
Wu, Chung-Ju 吳中如 Global Optimizations in Compilers for VLIW DSP Processors with Distributed Register Files |
author_sort |
Wu, Chung-Ju |
title |
Global Optimizations in Compilers for VLIW DSP Processors with Distributed Register Files |
title_short |
Global Optimizations in Compilers for VLIW DSP Processors with Distributed Register Files |
title_full |
Global Optimizations in Compilers for VLIW DSP Processors with Distributed Register Files |
title_fullStr |
Global Optimizations in Compilers for VLIW DSP Processors with Distributed Register Files |
title_full_unstemmed |
Global Optimizations in Compilers for VLIW DSP Processors with Distributed Register Files |
title_sort |
global optimizations in compilers for vliw dsp processors with distributed register files |
publishDate |
2015 |
url |
http://ndltd.ncl.edu.tw/handle/89728810996706561122 |
work_keys_str_mv |
AT wuchungju globaloptimizationsincompilersforvliwdspprocessorswithdistributedregisterfiles AT wúzhōngrú globaloptimizationsincompilersforvliwdspprocessorswithdistributedregisterfiles AT wuchungju yīngyòngyújùyǒufēnsànshìzàncúnqìzǔdechāozhǎngzhǐlìngjíshùwèixùnhàochùlǐqìzhīquányùxìngbiānyìqìzuìjiāhuà AT wúzhōngrú yīngyòngyújùyǒufēnsànshìzàncúnqìzǔdechāozhǎngzhǐlìngjíshùwèixùnhàochùlǐqìzhīquányùxìngbiānyìqìzuìjiāhuà |
_version_ |
1718376221512826880 |