NUMA Data-Access Bandwidth Characterization and Modeling
Clusters of seemingly homogeneous compute nodes are increasingly heterogeneous within each node due to replication and distribution of node-level subsystems. This intra-node heterogeneity can adversely affect program execution performance by inflicting additional data-access performance penalties wh...
Main Author: | |
---|---|
Other Authors: | |
Format: | Others |
Published: |
Virginia Tech
2014
|
Subjects: | |
Online Access: | http://hdl.handle.net/10919/31151 http://scholar.lib.vt.edu/theses/available/etd-02062012-121115/ |
id |
ndltd-VTETD-oai-vtechworks.lib.vt.edu-10919-31151 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-VTETD-oai-vtechworks.lib.vt.edu-10919-311512020-09-26T05:38:28Z NUMA Data-Access Bandwidth Characterization and Modeling Braithwaite, Ryan Karl Computer Science and Applications Feng, Wu-Chun Ribbens, Calvin J. McCormick, Patrick Performance Modeling NUMA Benchmarking Clusters of seemingly homogeneous compute nodes are increasingly heterogeneous within each node due to replication and distribution of node-level subsystems. This intra-node heterogeneity can adversely affect program execution performance by inflicting additional data-access performance penalties when accessing non-local data. In many modern NUMA architectures, both memory and I/O controllers are distributed within a node and CPU cores are logically divided into “local” and “remote” data-accesses within the system. In this thesis a method for analyzing main memory and PCIe data-access characteristics of modern AMD and Intel NUMA architectures is presented. Also presented here is the synthesis of data-access performance models designed to quantify the effects of these architectural characteristics on data-access bandwidth. Such performance models provide an analytical tool for determining the performance impact of remote data-accesses for a program or access pattern running in a given system. Data-access performance models also provide a means for comparing the data-access bandwidth and attributes of NUMA architectures, for improving application performance when running on these architectures, and for improving process/thread mapping onto CPU cores in these architectures. Preliminary examples of how programs respond to these data-access bandwidth characteristics are also presented as motivation for future work. Master of Science 2014-03-14T20:31:33Z 2014-03-14T20:31:33Z 2012-01-23 2012-02-06 2012-02-29 2012-02-29 Thesis etd-02062012-121115 http://hdl.handle.net/10919/31151 http://scholar.lib.vt.edu/theses/available/etd-02062012-121115/ Braithwaite_RyanK_T_2012.pdf In Copyright http://rightsstatements.org/vocab/InC/1.0/ application/pdf Virginia Tech |
collection |
NDLTD |
format |
Others
|
sources |
NDLTD |
topic |
Performance Modeling NUMA Benchmarking |
spellingShingle |
Performance Modeling NUMA Benchmarking Braithwaite, Ryan Karl NUMA Data-Access Bandwidth Characterization and Modeling |
description |
Clusters of seemingly homogeneous compute nodes are increasingly heterogeneous within each node due to replication and distribution of node-level subsystems. This intra-node heterogeneity can adversely affect program execution performance by inflicting additional data-access performance penalties when accessing non-local data. In many modern NUMA architectures, both memory and I/O controllers are distributed within a node and CPU cores are logically divided into “local” and “remote” data-accesses within the system. In this thesis a method for analyzing main memory and PCIe data-access characteristics of modern AMD and Intel NUMA architectures is presented. Also presented here is the synthesis of data-access performance models designed to quantify the effects of these architectural characteristics on data-access bandwidth. Such performance models provide an analytical tool for determining the performance impact of remote data-accesses for a program or access pattern running in a given system. Data-access performance models also provide a means for comparing the data-access bandwidth and attributes of NUMA architectures, for improving application performance when running on these architectures, and for improving process/thread mapping onto CPU cores in these architectures. Preliminary examples of how programs respond to these data-access bandwidth characteristics are also presented as motivation for future work. === Master of Science |
author2 |
Computer Science and Applications |
author_facet |
Computer Science and Applications Braithwaite, Ryan Karl |
author |
Braithwaite, Ryan Karl |
author_sort |
Braithwaite, Ryan Karl |
title |
NUMA Data-Access Bandwidth Characterization and Modeling |
title_short |
NUMA Data-Access Bandwidth Characterization and Modeling |
title_full |
NUMA Data-Access Bandwidth Characterization and Modeling |
title_fullStr |
NUMA Data-Access Bandwidth Characterization and Modeling |
title_full_unstemmed |
NUMA Data-Access Bandwidth Characterization and Modeling |
title_sort |
numa data-access bandwidth characterization and modeling |
publisher |
Virginia Tech |
publishDate |
2014 |
url |
http://hdl.handle.net/10919/31151 http://scholar.lib.vt.edu/theses/available/etd-02062012-121115/ |
work_keys_str_mv |
AT braithwaiteryankarl numadataaccessbandwidthcharacterizationandmodeling |
_version_ |
1719342560947732480 |