NUMA Data-Access Bandwidth Characterization and Modeling

Clusters of seemingly homogeneous compute nodes are increasingly heterogeneous within each node due to replication and distribution of node-level subsystems. This intra-node heterogeneity can adversely affect program execution performance by inflicting additional data-access performance penalties wh...

Full description

Bibliographic Details
Main Author: Braithwaite, Ryan Karl
Other Authors: Computer Science and Applications
Format: Others
Published: Virginia Tech 2014
Subjects:
Online Access:http://hdl.handle.net/10919/31151
http://scholar.lib.vt.edu/theses/available/etd-02062012-121115/
id ndltd-VTETD-oai-vtechworks.lib.vt.edu-10919-31151
record_format oai_dc
spelling ndltd-VTETD-oai-vtechworks.lib.vt.edu-10919-311512020-09-26T05:38:28Z NUMA Data-Access Bandwidth Characterization and Modeling Braithwaite, Ryan Karl Computer Science and Applications Feng, Wu-Chun Ribbens, Calvin J. McCormick, Patrick Performance Modeling NUMA Benchmarking Clusters of seemingly homogeneous compute nodes are increasingly heterogeneous within each node due to replication and distribution of node-level subsystems. This intra-node heterogeneity can adversely affect program execution performance by inflicting additional data-access performance penalties when accessing non-local data. In many modern NUMA architectures, both memory and I/O controllers are distributed within a node and CPU cores are logically divided into “local” and “remote” data-accesses within the system. In this thesis a method for analyzing main memory and PCIe data-access characteristics of modern AMD and Intel NUMA architectures is presented. Also presented here is the synthesis of data-access performance models designed to quantify the effects of these architectural characteristics on data-access bandwidth. Such performance models provide an analytical tool for determining the performance impact of remote data-accesses for a program or access pattern running in a given system. Data-access performance models also provide a means for comparing the data-access bandwidth and attributes of NUMA architectures, for improving application performance when running on these architectures, and for improving process/thread mapping onto CPU cores in these architectures. Preliminary examples of how programs respond to these data-access bandwidth characteristics are also presented as motivation for future work. Master of Science 2014-03-14T20:31:33Z 2014-03-14T20:31:33Z 2012-01-23 2012-02-06 2012-02-29 2012-02-29 Thesis etd-02062012-121115 http://hdl.handle.net/10919/31151 http://scholar.lib.vt.edu/theses/available/etd-02062012-121115/ Braithwaite_RyanK_T_2012.pdf In Copyright http://rightsstatements.org/vocab/InC/1.0/ application/pdf Virginia Tech
collection NDLTD
format Others
sources NDLTD
topic Performance Modeling
NUMA
Benchmarking
spellingShingle Performance Modeling
NUMA
Benchmarking
Braithwaite, Ryan Karl
NUMA Data-Access Bandwidth Characterization and Modeling
description Clusters of seemingly homogeneous compute nodes are increasingly heterogeneous within each node due to replication and distribution of node-level subsystems. This intra-node heterogeneity can adversely affect program execution performance by inflicting additional data-access performance penalties when accessing non-local data. In many modern NUMA architectures, both memory and I/O controllers are distributed within a node and CPU cores are logically divided into “local” and “remote” data-accesses within the system. In this thesis a method for analyzing main memory and PCIe data-access characteristics of modern AMD and Intel NUMA architectures is presented. Also presented here is the synthesis of data-access performance models designed to quantify the effects of these architectural characteristics on data-access bandwidth. Such performance models provide an analytical tool for determining the performance impact of remote data-accesses for a program or access pattern running in a given system. Data-access performance models also provide a means for comparing the data-access bandwidth and attributes of NUMA architectures, for improving application performance when running on these architectures, and for improving process/thread mapping onto CPU cores in these architectures. Preliminary examples of how programs respond to these data-access bandwidth characteristics are also presented as motivation for future work. === Master of Science
author2 Computer Science and Applications
author_facet Computer Science and Applications
Braithwaite, Ryan Karl
author Braithwaite, Ryan Karl
author_sort Braithwaite, Ryan Karl
title NUMA Data-Access Bandwidth Characterization and Modeling
title_short NUMA Data-Access Bandwidth Characterization and Modeling
title_full NUMA Data-Access Bandwidth Characterization and Modeling
title_fullStr NUMA Data-Access Bandwidth Characterization and Modeling
title_full_unstemmed NUMA Data-Access Bandwidth Characterization and Modeling
title_sort numa data-access bandwidth characterization and modeling
publisher Virginia Tech
publishDate 2014
url http://hdl.handle.net/10919/31151
http://scholar.lib.vt.edu/theses/available/etd-02062012-121115/
work_keys_str_mv AT braithwaiteryankarl numadataaccessbandwidthcharacterizationandmodeling
_version_ 1719342560947732480