Compiler and Architecture Support for Affine Register File on GPU

碩士 === 國立清華大學 === 資訊工程學系 === 104 === Graphic Processing Units (GPUs) are designed with a large number of threads to achieve high throughput and performance improvement. For rapidly switching to a different task, each thread has its own registers to store its context. These numerous register files pl...

Full description

Bibliographic Details
Main Authors:	Kan, Li Chen, 甘禮禎
Other Authors:	Lee, Jenq Kuen
Format:	Others
Language:	en_US
Published:	2016
Online Access:	http://ndltd.ncl.edu.tw/handle/tz9wbf

id	ndltd-TW-104NTHU5392138
record_format	oai_dc
spelling	ndltd-TW-104NTHU53921382019-05-15T23:09:26Z http://ndltd.ncl.edu.tw/handle/tz9wbf Compiler and Architecture Support for Affine Register File on GPU 針對圖形處理器上仿射暫存器的編譯器與架構支援 Kan, Li Chen 甘禮禎碩士國立清華大學資訊工程學系 104 Graphic Processing Units (GPUs) are designed with a large number of threads to achieve high throughput and performance improvement. For rapidly switching to a different task, each thread has its own registers to store its context. These numerous register files play an important role in performance improvement, but it also takes up a key part of GPU power consumption. Moreover, as shown in previous works, single instruction, multiple threads (SIMT) execution model used in modern GPUs exhibits significant computation redundancy. They execute the same input values and generate the same output values. To eliminate these redundancy computations, scalar/uniform register architecture of GPU is proposed. In this thesis, we proposed the affine register file design which deals with not only the uniform vectors but also the affine vectors. We dispose two registers to store two scalar values, base and stride, of the affine vectors and specific affine ALU to execute the affine computation. Compiler analyzes the value type which is uniform, affine or a general vector and allocates register according to energy saving and performance consideration. In the experiment, it shows that our design can reduce 72.98% and 77.88% energy consumption of register file and ALUs respectively and average 5.25% of total energy consumption of GPU. Lee, Jenq Kuen 李政崑 2016 學位論文 ; thesis 32 en_US
collection	NDLTD
language	en_US
format	Others
sources	NDLTD
description	碩士 === 國立清華大學 === 資訊工程學系 === 104 === Graphic Processing Units (GPUs) are designed with a large number of threads to achieve high throughput and performance improvement. For rapidly switching to a different task, each thread has its own registers to store its context. These numerous register files play an important role in performance improvement, but it also takes up a key part of GPU power consumption. Moreover, as shown in previous works, single instruction, multiple threads (SIMT) execution model used in modern GPUs exhibits significant computation redundancy. They execute the same input values and generate the same output values. To eliminate these redundancy computations, scalar/uniform register architecture of GPU is proposed. In this thesis, we proposed the affine register file design which deals with not only the uniform vectors but also the affine vectors. We dispose two registers to store two scalar values, base and stride, of the affine vectors and specific affine ALU to execute the affine computation. Compiler analyzes the value type which is uniform, affine or a general vector and allocates register according to energy saving and performance consideration. In the experiment, it shows that our design can reduce 72.98% and 77.88% energy consumption of register file and ALUs respectively and average 5.25% of total energy consumption of GPU.
author2	Lee, Jenq Kuen
author_facet	Lee, Jenq Kuen Kan, Li Chen 甘禮禎
author	Kan, Li Chen 甘禮禎
spellingShingle	Kan, Li Chen 甘禮禎 Compiler and Architecture Support for Affine Register File on GPU
author_sort	Kan, Li Chen
title	Compiler and Architecture Support for Affine Register File on GPU
title_short	Compiler and Architecture Support for Affine Register File on GPU
title_full	Compiler and Architecture Support for Affine Register File on GPU
title_fullStr	Compiler and Architecture Support for Affine Register File on GPU
title_full_unstemmed	Compiler and Architecture Support for Affine Register File on GPU
title_sort	compiler and architecture support for affine register file on gpu
publishDate	2016
url	http://ndltd.ncl.edu.tw/handle/tz9wbf
work_keys_str_mv	AT kanlichen compilerandarchitecturesupportforaffineregisterfileongpu AT gānlǐzhēn compilerandarchitecturesupportforaffineregisterfileongpu AT kanlichen zhēnduìtúxíngchùlǐqìshàngfǎngshèzàncúnqìdebiānyìqìyǔjiàgòuzhīyuán AT gānlǐzhēn zhēnduìtúxíngchùlǐqìshàngfǎngshèzàncúnqìdebiānyìqìyǔjiàgòuzhīyuán
_version_	1719141521029070848

Compiler and Architecture Support for Affine Register File on GPU

Similar Items