Understanding object-level memory access patterns across the spectrum

Memory accesses limit the performance and scalability of countless applications. Many design and optimization efforts will benefit from an in-depth understanding of memory access behavior, which is not offered by extant access tracing and profiling methods. In this paper, we adopt a holistic memory...

Full description

Bibliographic Details
Main Authors: El-Sayed, Nosayba (Author), Sanchez, Daniel (Author)
Other Authors: Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory (Contributor)
Format: Article
Language:English
Published: Association for Computing Machinery (ACM), 2021-04-13T12:36:31Z.
Subjects:
Online Access:Get fulltext
LEADER 02217 am a22002293u 4500
001 130463
042 |a dc 
100 1 0 |a El-Sayed, Nosayba  |e author 
100 1 0 |a Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory  |e contributor 
700 1 0 |a Sanchez, Daniel  |e author 
245 0 0 |a Understanding object-level memory access patterns across the spectrum 
260 |b Association for Computing Machinery (ACM),   |c 2021-04-13T12:36:31Z. 
856 |z Get fulltext  |u https://hdl.handle.net/1721.1/130463 
520 |a Memory accesses limit the performance and scalability of countless applications. Many design and optimization efforts will benefit from an in-depth understanding of memory access behavior, which is not offered by extant access tracing and profiling methods. In this paper, we adopt a holistic memory access profiling approach to enable a better understanding of program-system memory interactions. We have developed a two-pass tool adopting fast online and slow offline profiling, with which we have profiled, at the variable/object level, a collection of 38 representative applications spanning major domains (HPC, personal computing, data analytics, AI, graph processing, and datacenter workloads), at varying problem sizes. We have performed detailed result analysis and code examination. Our findings provide new insights into application memory behavior, including insights on per-object access patterns, adoption of data structures, and memory-access changes at different problem sizes. We find that scientific computation applications exhibit distinct behaviors compared to datacenter workloads, motivating separate memory system design/optimizations. 
520 |a National Key Basic Research Program of China (Grant 2016YFA0602100) 
520 |a National Science Foundation (China) (Grant No. 91530323) 
520 |a National Research Foundation of Korea (Grant 2015R1C1A1A0152105) 
520 |a United States. Department of Energy (Contract DE-AC05-00OR22725) 
546 |a en 
655 7 |a Article 
773 |t 10.1145/3126908.3126917 
773 |t Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, SC 2017