Compiler and Architecture Support for Affine Register File on GPU
碩士 === 國立清華大學 === 資訊工程學系 === 104 === Graphic Processing Units (GPUs) are designed with a large number of threads to achieve high throughput and performance improvement. For rapidly switching to a different task, each thread has its own registers to store its context. These numerous register files pl...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | en_US |
Published: |
2016
|
Online Access: | http://ndltd.ncl.edu.tw/handle/tz9wbf |
id |
ndltd-TW-104NTHU5392138 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-104NTHU53921382019-05-15T23:09:26Z http://ndltd.ncl.edu.tw/handle/tz9wbf Compiler and Architecture Support for Affine Register File on GPU 針對圖形處理器上仿射暫存器的編譯器與架構支援 Kan, Li Chen 甘禮禎 碩士 國立清華大學 資訊工程學系 104 Graphic Processing Units (GPUs) are designed with a large number of threads to achieve high throughput and performance improvement. For rapidly switching to a different task, each thread has its own registers to store its context. These numerous register files play an important role in performance improvement, but it also takes up a key part of GPU power consumption. Moreover, as shown in previous works, single instruction, multiple threads (SIMT) execution model used in modern GPUs exhibits significant computation redundancy. They execute the same input values and generate the same output values. To eliminate these redundancy computations, scalar/uniform register architecture of GPU is proposed. In this thesis, we proposed the affine register file design which deals with not only the uniform vectors but also the affine vectors. We dispose two registers to store two scalar values, base and stride, of the affine vectors and specific affine ALU to execute the affine computation. Compiler analyzes the value type which is uniform, affine or a general vector and allocates register according to energy saving and performance consideration. In the experiment, it shows that our design can reduce 72.98% and 77.88% energy consumption of register file and ALUs respectively and average 5.25% of total energy consumption of GPU. Lee, Jenq Kuen 李政崑 2016 學位論文 ; thesis 32 en_US |
collection |
NDLTD |
language |
en_US |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立清華大學 === 資訊工程學系 === 104 === Graphic Processing Units (GPUs) are designed with a large number of threads to achieve high throughput and performance improvement. For rapidly switching to a different task, each thread has its own registers to store its context. These numerous register files play an important role in performance improvement, but it also takes up a key part of GPU power consumption. Moreover, as shown in previous works, single instruction, multiple threads (SIMT) execution model used in modern GPUs exhibits significant computation redundancy. They execute the same input values and generate the same output values. To eliminate these redundancy computations, scalar/uniform register architecture of GPU is proposed. In this thesis, we proposed the affine register file design which deals with not only the uniform vectors but also the affine vectors. We dispose two registers to store two scalar values, base and stride, of the affine vectors and specific affine ALU to execute the affine computation. Compiler analyzes the value type which is uniform, affine or a general vector and allocates register according to energy saving and performance consideration. In the experiment, it shows that our design can reduce 72.98% and 77.88% energy consumption of register file and ALUs respectively and average 5.25% of total energy consumption of GPU.
|
author2 |
Lee, Jenq Kuen |
author_facet |
Lee, Jenq Kuen Kan, Li Chen 甘禮禎 |
author |
Kan, Li Chen 甘禮禎 |
spellingShingle |
Kan, Li Chen 甘禮禎 Compiler and Architecture Support for Affine Register File on GPU |
author_sort |
Kan, Li Chen |
title |
Compiler and Architecture Support for Affine Register File on GPU |
title_short |
Compiler and Architecture Support for Affine Register File on GPU |
title_full |
Compiler and Architecture Support for Affine Register File on GPU |
title_fullStr |
Compiler and Architecture Support for Affine Register File on GPU |
title_full_unstemmed |
Compiler and Architecture Support for Affine Register File on GPU |
title_sort |
compiler and architecture support for affine register file on gpu |
publishDate |
2016 |
url |
http://ndltd.ncl.edu.tw/handle/tz9wbf |
work_keys_str_mv |
AT kanlichen compilerandarchitecturesupportforaffineregisterfileongpu AT gānlǐzhēn compilerandarchitecturesupportforaffineregisterfileongpu AT kanlichen zhēnduìtúxíngchùlǐqìshàngfǎngshèzàncúnqìdebiānyìqìyǔjiàgòuzhīyuán AT gānlǐzhēn zhēnduìtúxíngchùlǐqìshàngfǎngshèzàncúnqìdebiānyìqìyǔjiàgòuzhīyuán |
_version_ |
1719141521029070848 |