Compiler and Architecture Support for Affine Register File on GPU

碩士 === 國立清華大學 === 資訊工程學系 === 104 === Graphic Processing Units (GPUs) are designed with a large number of threads to achieve high throughput and performance improvement. For rapidly switching to a different task, each thread has its own registers to store its context. These numerous register files pl...

Full description

Bibliographic Details
Main Authors: Kan, Li Chen, 甘禮禎
Other Authors: Lee, Jenq Kuen
Format: Others
Language:en_US
Published: 2016
Online Access:http://ndltd.ncl.edu.tw/handle/tz9wbf
id ndltd-TW-104NTHU5392138
record_format oai_dc
spelling ndltd-TW-104NTHU53921382019-05-15T23:09:26Z http://ndltd.ncl.edu.tw/handle/tz9wbf Compiler and Architecture Support for Affine Register File on GPU 針對圖形處理器上仿射暫存器的編譯器與架構支援 Kan, Li Chen 甘禮禎 碩士 國立清華大學 資訊工程學系 104 Graphic Processing Units (GPUs) are designed with a large number of threads to achieve high throughput and performance improvement. For rapidly switching to a different task, each thread has its own registers to store its context. These numerous register files play an important role in performance improvement, but it also takes up a key part of GPU power consumption. Moreover, as shown in previous works, single instruction, multiple threads (SIMT) execution model used in modern GPUs exhibits significant computation redundancy. They execute the same input values and generate the same output values. To eliminate these redundancy computations, scalar/uniform register architecture of GPU is proposed. In this thesis, we proposed the affine register file design which deals with not only the uniform vectors but also the affine vectors. We dispose two registers to store two scalar values, base and stride, of the affine vectors and specific affine ALU to execute the affine computation. Compiler analyzes the value type which is uniform, affine or a general vector and allocates register according to energy saving and performance consideration. In the experiment, it shows that our design can reduce 72.98% and 77.88% energy consumption of register file and ALUs respectively and average 5.25% of total energy consumption of GPU. Lee, Jenq Kuen 李政崑 2016 學位論文 ; thesis 32 en_US
collection NDLTD
language en_US
format Others
sources NDLTD
description 碩士 === 國立清華大學 === 資訊工程學系 === 104 === Graphic Processing Units (GPUs) are designed with a large number of threads to achieve high throughput and performance improvement. For rapidly switching to a different task, each thread has its own registers to store its context. These numerous register files play an important role in performance improvement, but it also takes up a key part of GPU power consumption. Moreover, as shown in previous works, single instruction, multiple threads (SIMT) execution model used in modern GPUs exhibits significant computation redundancy. They execute the same input values and generate the same output values. To eliminate these redundancy computations, scalar/uniform register architecture of GPU is proposed. In this thesis, we proposed the affine register file design which deals with not only the uniform vectors but also the affine vectors. We dispose two registers to store two scalar values, base and stride, of the affine vectors and specific affine ALU to execute the affine computation. Compiler analyzes the value type which is uniform, affine or a general vector and allocates register according to energy saving and performance consideration. In the experiment, it shows that our design can reduce 72.98% and 77.88% energy consumption of register file and ALUs respectively and average 5.25% of total energy consumption of GPU.
author2 Lee, Jenq Kuen
author_facet Lee, Jenq Kuen
Kan, Li Chen
甘禮禎
author Kan, Li Chen
甘禮禎
spellingShingle Kan, Li Chen
甘禮禎
Compiler and Architecture Support for Affine Register File on GPU
author_sort Kan, Li Chen
title Compiler and Architecture Support for Affine Register File on GPU
title_short Compiler and Architecture Support for Affine Register File on GPU
title_full Compiler and Architecture Support for Affine Register File on GPU
title_fullStr Compiler and Architecture Support for Affine Register File on GPU
title_full_unstemmed Compiler and Architecture Support for Affine Register File on GPU
title_sort compiler and architecture support for affine register file on gpu
publishDate 2016
url http://ndltd.ncl.edu.tw/handle/tz9wbf
work_keys_str_mv AT kanlichen compilerandarchitecturesupportforaffineregisterfileongpu
AT gānlǐzhēn compilerandarchitecturesupportforaffineregisterfileongpu
AT kanlichen zhēnduìtúxíngchùlǐqìshàngfǎngshèzàncúnqìdebiānyìqìyǔjiàgòuzhīyuán
AT gānlǐzhēn zhēnduìtúxíngchùlǐqìshàngfǎngshèzàncúnqìdebiānyìqìyǔjiàgòuzhīyuán
_version_ 1719141521029070848