ESL Evaluation of Memory Interface Architecture for Many-Core System
碩士 === 國立清華大學 === 資訊工程學系 === 104 === Because the advantage of DRAM is its structural simplicity: high densities and more inexpensive than other type of RAM, it is very suited to be a role of main-memory in computer architecture. However, for many years, DRAM access latencies have not decreased at th...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | en_US |
Published: |
2016
|
Online Access: | http://ndltd.ncl.edu.tw/handle/52923442022193345617 |
id |
ndltd-TW-104NTHU5392101 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-104NTHU53921012017-08-27T04:30:36Z http://ndltd.ncl.edu.tw/handle/52923442022193345617 ESL Evaluation of Memory Interface Architecture for Many-Core System 電子系統層級多核心平台之記憶體架構評估 Lai, Jyun-Long 賴俊龍 碩士 國立清華大學 資訊工程學系 104 Because the advantage of DRAM is its structural simplicity: high densities and more inexpensive than other type of RAM, it is very suited to be a role of main-memory in computer architecture. However, for many years, DRAM access latencies have not decreased at the same rate as microprocessor cycle times. In other words, the rate of improvement in processor speed exceeds the rate of improvement in DRAM memory speed, that W. Wulf and S. McKee called the phenomenon "memory wall". Therefore, in past few decades, people do not blindly upgrades single processor’s performance, but increasing the amount of on-chip cores or using the NoC-based many-core architecture for the throughput and low power consumption. Unfortunately, the demand for memory bandwidth or throughput is still increased. Therefore, many engineers dedicate to improve the efficiency between memory controller and DRAM by proposing better memory scheduling policy, increasing bandwidth and improving the access speed, etc. Recently, the emergence of 3D-stacked DRAM (wide I/O) slightly reduces the speed gap between processor and memory system. But the many-core architecture which use mesh or torus architecture a bridge to connect processors and memory controllers has a characteristic that some DRAM request from processor may go through very far distance to access memory controller. Based on the above motivation, we present an architecture which improves the efficiency of accessing stacked memories and reduce routing time on many-core platform. We use an extra crossbar switch interconnect to transport the DRAM request and groups few numbers of processor to specify DRAM-channel. We call the traditional method as \textbf{Original approach} and call our proposed architecture as \textbf{CS-based approach}. Experimental results of SPLASH2 applications demonstrates speed up that ranges from 1.02 to 1.13 times, with crossbar switch interconnect. Huang, Chih-Tsun 黃稚存 2016 學位論文 ; thesis 55 en_US |
collection |
NDLTD |
language |
en_US |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立清華大學 === 資訊工程學系 === 104 === Because the advantage of DRAM is its structural simplicity: high densities and more inexpensive than other type of RAM, it is very suited to be a role of main-memory in computer architecture. However, for many years, DRAM access latencies have not decreased at the same rate as microprocessor cycle times. In other words, the rate of improvement in processor speed exceeds the rate of improvement in DRAM memory speed, that W. Wulf and S. McKee called the phenomenon "memory wall". Therefore, in past few decades, people do not blindly upgrades single processor’s performance, but increasing the amount of on-chip cores or using the NoC-based many-core architecture for the throughput and low power consumption. Unfortunately, the demand for memory bandwidth or throughput is still increased. Therefore, many engineers dedicate to improve the efficiency between memory controller and DRAM by proposing better memory scheduling policy, increasing bandwidth and improving the access speed, etc. Recently, the emergence of 3D-stacked DRAM (wide I/O) slightly reduces the speed gap between processor and memory system. But the many-core architecture which use mesh or torus architecture a bridge to connect processors and memory controllers has a characteristic that some DRAM request from processor may go through very far distance to access memory controller. Based on the above motivation, we present an architecture which improves the efficiency of accessing stacked memories and reduce routing time on many-core platform. We use an extra crossbar switch interconnect to transport the DRAM request and groups few numbers of processor to specify DRAM-channel. We call the traditional method as \textbf{Original approach} and call our proposed architecture as \textbf{CS-based approach}. Experimental results of SPLASH2 applications demonstrates speed up that ranges from 1.02 to 1.13 times, with crossbar switch interconnect.
|
author2 |
Huang, Chih-Tsun |
author_facet |
Huang, Chih-Tsun Lai, Jyun-Long 賴俊龍 |
author |
Lai, Jyun-Long 賴俊龍 |
spellingShingle |
Lai, Jyun-Long 賴俊龍 ESL Evaluation of Memory Interface Architecture for Many-Core System |
author_sort |
Lai, Jyun-Long |
title |
ESL Evaluation of Memory Interface Architecture for Many-Core System |
title_short |
ESL Evaluation of Memory Interface Architecture for Many-Core System |
title_full |
ESL Evaluation of Memory Interface Architecture for Many-Core System |
title_fullStr |
ESL Evaluation of Memory Interface Architecture for Many-Core System |
title_full_unstemmed |
ESL Evaluation of Memory Interface Architecture for Many-Core System |
title_sort |
esl evaluation of memory interface architecture for many-core system |
publishDate |
2016 |
url |
http://ndltd.ncl.edu.tw/handle/52923442022193345617 |
work_keys_str_mv |
AT laijyunlong eslevaluationofmemoryinterfacearchitectureformanycoresystem AT làijùnlóng eslevaluationofmemoryinterfacearchitectureformanycoresystem AT laijyunlong diànzixìtǒngcéngjíduōhéxīnpíngtáizhījìyìtǐjiàgòupínggū AT làijùnlóng diànzixìtǒngcéngjíduōhéxīnpíngtáizhījìyìtǐjiàgòupínggū |
_version_ |
1718520154538639360 |