ZAKI+: A Machine Learning Based Process Mapping Tool for SpMV Computations on Distributed Memory Architectures
Smart cities and other cyber-physical systems (CPSs) rely on various scientific, engineering, business, and social applications that provide timely intelligence for their design, operations, and management. Many of these scientific and analytics applications require the solution of sparse linear equ...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2019-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/8737900/ |
id |
doaj-3e9287c4cd90492b8a78b52e8aff65e9 |
---|---|
record_format |
Article |
spelling |
doaj-3e9287c4cd90492b8a78b52e8aff65e92021-03-30T00:08:14ZengIEEEIEEE Access2169-35362019-01-017812798129610.1109/ACCESS.2019.29235658737900ZAKI+: A Machine Learning Based Process Mapping Tool for SpMV Computations on Distributed Memory ArchitecturesSardar Usman0Rashid Mehmood1https://orcid.org/0000-0002-4997-5322Iyad Katib2Aiiad Albeshri3Department of Computer Science, Faculty of Computing and Information Technology, King Abdul Aziz University, Jeddah, Saudi ArabiaHigh Performance Computing Center, King Abdulaziz University, Jeddah, Saudi ArabiaDepartment of Computer Science, Faculty of Computing and Information Technology, King Abdul Aziz University, Jeddah, Saudi ArabiaDepartment of Computer Science, Faculty of Computing and Information Technology, King Abdul Aziz University, Jeddah, Saudi ArabiaSmart cities and other cyber-physical systems (CPSs) rely on various scientific, engineering, business, and social applications that provide timely intelligence for their design, operations, and management. Many of these scientific and analytics applications require the solution of sparse linear equation systems, where sparse matrix-vector (SpMV) product is a key computing operation. Several factors determine the performance of parallel SpMV computations, including matrix characteristics, storage formats, and the rising complexity and heterogeneity of computer systems. There is a pressing need for new ways of exploiting parallelism, and mapping data and applications to the computing resources. We propose here ZAKI+, a data-driven machine-learning approach, allowing users to automatically, effortlessly, and speedily obtain the best configuration (the data distribution, the optimal number of processes, and mapping strategy) and performance for the execution of the parallel SpMV computations on distributed memory machines. We train and test the tool using three machine learning methods-decision trees, random forest, and Xtreme boosting-and nearly 2000 real-world matrices obtained from 45 application domains, including computer vision and robotics. ZAKI+ provides optimal process mapping and outperforms the MPI default mapping policy by a factor of 4.24. This is the first work where the sparsity structure of matrices has been exploited to predict the optimal mapping of processes and data in distributed-memory environments by using different base and ensemble machine learning methods. Various CPSs comprise compute-intensive machine learning applications, such as the SpMV, and hence, the process and data mapping contributions of this paper would be of paramount impact for the CPSs.https://ieeexplore.ieee.org/document/8737900/Cyber-physical systemsSpMVsparse linear algebrasparse matricesmachine learningMPI |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Sardar Usman Rashid Mehmood Iyad Katib Aiiad Albeshri |
spellingShingle |
Sardar Usman Rashid Mehmood Iyad Katib Aiiad Albeshri ZAKI+: A Machine Learning Based Process Mapping Tool for SpMV Computations on Distributed Memory Architectures IEEE Access Cyber-physical systems SpMV sparse linear algebra sparse matrices machine learning MPI |
author_facet |
Sardar Usman Rashid Mehmood Iyad Katib Aiiad Albeshri |
author_sort |
Sardar Usman |
title |
ZAKI+: A Machine Learning Based Process Mapping Tool for SpMV Computations on Distributed Memory Architectures |
title_short |
ZAKI+: A Machine Learning Based Process Mapping Tool for SpMV Computations on Distributed Memory Architectures |
title_full |
ZAKI+: A Machine Learning Based Process Mapping Tool for SpMV Computations on Distributed Memory Architectures |
title_fullStr |
ZAKI+: A Machine Learning Based Process Mapping Tool for SpMV Computations on Distributed Memory Architectures |
title_full_unstemmed |
ZAKI+: A Machine Learning Based Process Mapping Tool for SpMV Computations on Distributed Memory Architectures |
title_sort |
zaki+: a machine learning based process mapping tool for spmv computations on distributed memory architectures |
publisher |
IEEE |
series |
IEEE Access |
issn |
2169-3536 |
publishDate |
2019-01-01 |
description |
Smart cities and other cyber-physical systems (CPSs) rely on various scientific, engineering, business, and social applications that provide timely intelligence for their design, operations, and management. Many of these scientific and analytics applications require the solution of sparse linear equation systems, where sparse matrix-vector (SpMV) product is a key computing operation. Several factors determine the performance of parallel SpMV computations, including matrix characteristics, storage formats, and the rising complexity and heterogeneity of computer systems. There is a pressing need for new ways of exploiting parallelism, and mapping data and applications to the computing resources. We propose here ZAKI+, a data-driven machine-learning approach, allowing users to automatically, effortlessly, and speedily obtain the best configuration (the data distribution, the optimal number of processes, and mapping strategy) and performance for the execution of the parallel SpMV computations on distributed memory machines. We train and test the tool using three machine learning methods-decision trees, random forest, and Xtreme boosting-and nearly 2000 real-world matrices obtained from 45 application domains, including computer vision and robotics. ZAKI+ provides optimal process mapping and outperforms the MPI default mapping policy by a factor of 4.24. This is the first work where the sparsity structure of matrices has been exploited to predict the optimal mapping of processes and data in distributed-memory environments by using different base and ensemble machine learning methods. Various CPSs comprise compute-intensive machine learning applications, such as the SpMV, and hence, the process and data mapping contributions of this paper would be of paramount impact for the CPSs. |
topic |
Cyber-physical systems SpMV sparse linear algebra sparse matrices machine learning MPI |
url |
https://ieeexplore.ieee.org/document/8737900/ |
work_keys_str_mv |
AT sardarusman zakix002bamachinelearningbasedprocessmappingtoolforspmvcomputationsondistributedmemoryarchitectures AT rashidmehmood zakix002bamachinelearningbasedprocessmappingtoolforspmvcomputationsondistributedmemoryarchitectures AT iyadkatib zakix002bamachinelearningbasedprocessmappingtoolforspmvcomputationsondistributedmemoryarchitectures AT aiiadalbeshri zakix002bamachinelearningbasedprocessmappingtoolforspmvcomputationsondistributedmemoryarchitectures |
_version_ |
1724188646000558080 |