Cachekata : memory hierarchy optimization via dynamic binary translation

As hardware parallelism continues to increase, CPU caches can no longer be considered a transparent, hardware-level performance optimization. Adverse cache impact on performance is entirely workload-dependent and may depend on runtime factors. The operating system must begin to treat CPU caches like...

Full description

Bibliographic Details
Main Author: Taylor, Nathan Bryan
Language:English
Published: University of British Columbia 2013
Online Access:http://hdl.handle.net/2429/44335
id ndltd-LACETR-oai-collectionscanada.gc.ca-BVAU.2429-44335
record_format oai_dc
spelling ndltd-LACETR-oai-collectionscanada.gc.ca-BVAU.2429-443352014-03-26T03:39:31Z Cachekata : memory hierarchy optimization via dynamic binary translation Taylor, Nathan Bryan As hardware parallelism continues to increase, CPU caches can no longer be considered a transparent, hardware-level performance optimization. Adverse cache impact on performance is entirely workload-dependent and may depend on runtime factors. The operating system must begin to treat CPU caches like any other shared hardware resource to effectively support workloads on parallel hardware. We present a binary translation system called Cachekata that provides a byte-granular memory remapping facility within the OS in an efficient manner. Cachekata is incorporated into a larger system, Plastic, which diagnoses and corrects instances of false sharing occurring within running applications. Our implementation is able to achieve a 3-6x performance improvement on known examples of false sharing in parallel benchmarks. 2013-04-19T16:37:38Z 2013-04-20T09:13:29Z 2013 2013-04-19 2013-05 Electronic Thesis or Dissertation http://hdl.handle.net/2429/44335 eng University of British Columbia
collection NDLTD
language English
sources NDLTD
description As hardware parallelism continues to increase, CPU caches can no longer be considered a transparent, hardware-level performance optimization. Adverse cache impact on performance is entirely workload-dependent and may depend on runtime factors. The operating system must begin to treat CPU caches like any other shared hardware resource to effectively support workloads on parallel hardware. We present a binary translation system called Cachekata that provides a byte-granular memory remapping facility within the OS in an efficient manner. Cachekata is incorporated into a larger system, Plastic, which diagnoses and corrects instances of false sharing occurring within running applications. Our implementation is able to achieve a 3-6x performance improvement on known examples of false sharing in parallel benchmarks.
author Taylor, Nathan Bryan
spellingShingle Taylor, Nathan Bryan
Cachekata : memory hierarchy optimization via dynamic binary translation
author_facet Taylor, Nathan Bryan
author_sort Taylor, Nathan Bryan
title Cachekata : memory hierarchy optimization via dynamic binary translation
title_short Cachekata : memory hierarchy optimization via dynamic binary translation
title_full Cachekata : memory hierarchy optimization via dynamic binary translation
title_fullStr Cachekata : memory hierarchy optimization via dynamic binary translation
title_full_unstemmed Cachekata : memory hierarchy optimization via dynamic binary translation
title_sort cachekata : memory hierarchy optimization via dynamic binary translation
publisher University of British Columbia
publishDate 2013
url http://hdl.handle.net/2429/44335
work_keys_str_mv AT taylornathanbryan cachekatamemoryhierarchyoptimizationviadynamicbinarytranslation
_version_ 1716656698334117888