Cachekata : memory hierarchy optimization via dynamic binary translation
As hardware parallelism continues to increase, CPU caches can no longer be considered a transparent, hardware-level performance optimization. Adverse cache impact on performance is entirely workload-dependent and may depend on runtime factors. The operating system must begin to treat CPU caches like...
Main Author: | |
---|---|
Language: | English |
Published: |
University of British Columbia
2013
|
Online Access: | http://hdl.handle.net/2429/44335 |
id |
ndltd-LACETR-oai-collectionscanada.gc.ca-BVAU.2429-44335 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-LACETR-oai-collectionscanada.gc.ca-BVAU.2429-443352014-03-26T03:39:31Z Cachekata : memory hierarchy optimization via dynamic binary translation Taylor, Nathan Bryan As hardware parallelism continues to increase, CPU caches can no longer be considered a transparent, hardware-level performance optimization. Adverse cache impact on performance is entirely workload-dependent and may depend on runtime factors. The operating system must begin to treat CPU caches like any other shared hardware resource to effectively support workloads on parallel hardware. We present a binary translation system called Cachekata that provides a byte-granular memory remapping facility within the OS in an efficient manner. Cachekata is incorporated into a larger system, Plastic, which diagnoses and corrects instances of false sharing occurring within running applications. Our implementation is able to achieve a 3-6x performance improvement on known examples of false sharing in parallel benchmarks. 2013-04-19T16:37:38Z 2013-04-20T09:13:29Z 2013 2013-04-19 2013-05 Electronic Thesis or Dissertation http://hdl.handle.net/2429/44335 eng University of British Columbia |
collection |
NDLTD |
language |
English |
sources |
NDLTD |
description |
As hardware parallelism continues to increase, CPU caches can no longer be considered a transparent, hardware-level performance optimization. Adverse cache impact on performance is entirely workload-dependent and may depend on runtime factors. The operating system must begin to treat CPU caches like any other shared hardware resource to effectively support workloads on parallel hardware.
We present a binary translation system called Cachekata that provides a byte-granular memory remapping facility within the OS in an efficient manner. Cachekata is incorporated into a larger system, Plastic, which diagnoses and corrects instances of false sharing occurring within running applications. Our implementation is able to achieve a 3-6x performance improvement on known examples of false sharing in parallel benchmarks. |
author |
Taylor, Nathan Bryan |
spellingShingle |
Taylor, Nathan Bryan Cachekata : memory hierarchy optimization via dynamic binary translation |
author_facet |
Taylor, Nathan Bryan |
author_sort |
Taylor, Nathan Bryan |
title |
Cachekata : memory hierarchy optimization via dynamic binary translation |
title_short |
Cachekata : memory hierarchy optimization via dynamic binary translation |
title_full |
Cachekata : memory hierarchy optimization via dynamic binary translation |
title_fullStr |
Cachekata : memory hierarchy optimization via dynamic binary translation |
title_full_unstemmed |
Cachekata : memory hierarchy optimization via dynamic binary translation |
title_sort |
cachekata : memory hierarchy optimization via dynamic binary translation |
publisher |
University of British Columbia |
publishDate |
2013 |
url |
http://hdl.handle.net/2429/44335 |
work_keys_str_mv |
AT taylornathanbryan cachekatamemoryhierarchyoptimizationviadynamicbinarytranslation |
_version_ |
1716656698334117888 |