ESL Evaluation of Memory Interface Architecture for Many-Core System

碩士 === 國立清華大學 === 資訊工程學系 === 104 === Because the advantage of DRAM is its structural simplicity: high densities and more inexpensive than other type of RAM, it is very suited to be a role of main-memory in computer architecture. However, for many years, DRAM access latencies have not decreased at th...

Full description

Bibliographic Details
Main Authors: Lai, Jyun-Long, 賴俊龍
Other Authors: Huang, Chih-Tsun
Format: Others
Language:en_US
Published: 2016
Online Access:http://ndltd.ncl.edu.tw/handle/52923442022193345617
id ndltd-TW-104NTHU5392101
record_format oai_dc
spelling ndltd-TW-104NTHU53921012017-08-27T04:30:36Z http://ndltd.ncl.edu.tw/handle/52923442022193345617 ESL Evaluation of Memory Interface Architecture for Many-Core System 電子系統層級多核心平台之記憶體架構評估 Lai, Jyun-Long 賴俊龍 碩士 國立清華大學 資訊工程學系 104 Because the advantage of DRAM is its structural simplicity: high densities and more inexpensive than other type of RAM, it is very suited to be a role of main-memory in computer architecture. However, for many years, DRAM access latencies have not decreased at the same rate as microprocessor cycle times. In other words, the rate of improvement in processor speed exceeds the rate of improvement in DRAM memory speed, that W. Wulf and S. McKee called the phenomenon "memory wall". Therefore, in past few decades, people do not blindly upgrades single processor’s performance, but increasing the amount of on-chip cores or using the NoC-based many-core architecture for the throughput and low power consumption. Unfortunately, the demand for memory bandwidth or throughput is still increased. Therefore, many engineers dedicate to improve the efficiency between memory controller and DRAM by proposing better memory scheduling policy, increasing bandwidth and improving the access speed, etc. Recently, the emergence of 3D-stacked DRAM (wide I/O) slightly reduces the speed gap between processor and memory system. But the many-core architecture which use mesh or torus architecture a bridge to connect processors and memory controllers has a characteristic that some DRAM request from processor may go through very far distance to access memory controller. Based on the above motivation, we present an architecture which improves the efficiency of accessing stacked memories and reduce routing time on many-core platform. We use an extra crossbar switch interconnect to transport the DRAM request and groups few numbers of processor to specify DRAM-channel. We call the traditional method as \textbf{Original approach} and call our proposed architecture as \textbf{CS-based approach}. Experimental results of SPLASH2 applications demonstrates speed up that ranges from 1.02 to 1.13 times, with crossbar switch interconnect. Huang, Chih-Tsun 黃稚存 2016 學位論文 ; thesis 55 en_US
collection NDLTD
language en_US
format Others
sources NDLTD
description 碩士 === 國立清華大學 === 資訊工程學系 === 104 === Because the advantage of DRAM is its structural simplicity: high densities and more inexpensive than other type of RAM, it is very suited to be a role of main-memory in computer architecture. However, for many years, DRAM access latencies have not decreased at the same rate as microprocessor cycle times. In other words, the rate of improvement in processor speed exceeds the rate of improvement in DRAM memory speed, that W. Wulf and S. McKee called the phenomenon "memory wall". Therefore, in past few decades, people do not blindly upgrades single processor’s performance, but increasing the amount of on-chip cores or using the NoC-based many-core architecture for the throughput and low power consumption. Unfortunately, the demand for memory bandwidth or throughput is still increased. Therefore, many engineers dedicate to improve the efficiency between memory controller and DRAM by proposing better memory scheduling policy, increasing bandwidth and improving the access speed, etc. Recently, the emergence of 3D-stacked DRAM (wide I/O) slightly reduces the speed gap between processor and memory system. But the many-core architecture which use mesh or torus architecture a bridge to connect processors and memory controllers has a characteristic that some DRAM request from processor may go through very far distance to access memory controller. Based on the above motivation, we present an architecture which improves the efficiency of accessing stacked memories and reduce routing time on many-core platform. We use an extra crossbar switch interconnect to transport the DRAM request and groups few numbers of processor to specify DRAM-channel. We call the traditional method as \textbf{Original approach} and call our proposed architecture as \textbf{CS-based approach}. Experimental results of SPLASH2 applications demonstrates speed up that ranges from 1.02 to 1.13 times, with crossbar switch interconnect.
author2 Huang, Chih-Tsun
author_facet Huang, Chih-Tsun
Lai, Jyun-Long
賴俊龍
author Lai, Jyun-Long
賴俊龍
spellingShingle Lai, Jyun-Long
賴俊龍
ESL Evaluation of Memory Interface Architecture for Many-Core System
author_sort Lai, Jyun-Long
title ESL Evaluation of Memory Interface Architecture for Many-Core System
title_short ESL Evaluation of Memory Interface Architecture for Many-Core System
title_full ESL Evaluation of Memory Interface Architecture for Many-Core System
title_fullStr ESL Evaluation of Memory Interface Architecture for Many-Core System
title_full_unstemmed ESL Evaluation of Memory Interface Architecture for Many-Core System
title_sort esl evaluation of memory interface architecture for many-core system
publishDate 2016
url http://ndltd.ncl.edu.tw/handle/52923442022193345617
work_keys_str_mv AT laijyunlong eslevaluationofmemoryinterfacearchitectureformanycoresystem
AT làijùnlóng eslevaluationofmemoryinterfacearchitectureformanycoresystem
AT laijyunlong diànzixìtǒngcéngjíduōhéxīnpíngtáizhījìyìtǐjiàgòupínggū
AT làijùnlóng diànzixìtǒngcéngjíduōhéxīnpíngtáizhījìyìtǐjiàgòupínggū
_version_ 1718520154538639360