ZAKI+: A Machine Learning Based Process Mapping Tool for SpMV Computations on Distributed Memory Architectures

Smart cities and other cyber-physical systems (CPSs) rely on various scientific, engineering, business, and social applications that provide timely intelligence for their design, operations, and management. Many of these scientific and analytics applications require the solution of sparse linear equ...

Full description

Bibliographic Details
Main Authors: Sardar Usman, Rashid Mehmood, Iyad Katib, Aiiad Albeshri
Format: Article
Language:English
Published: IEEE 2019-01-01
Series:IEEE Access
Subjects:
MPI
Online Access:https://ieeexplore.ieee.org/document/8737900/
id doaj-3e9287c4cd90492b8a78b52e8aff65e9
record_format Article
spelling doaj-3e9287c4cd90492b8a78b52e8aff65e92021-03-30T00:08:14ZengIEEEIEEE Access2169-35362019-01-017812798129610.1109/ACCESS.2019.29235658737900ZAKI+: A Machine Learning Based Process Mapping Tool for SpMV Computations on Distributed Memory ArchitecturesSardar Usman0Rashid Mehmood1https://orcid.org/0000-0002-4997-5322Iyad Katib2Aiiad Albeshri3Department of Computer Science, Faculty of Computing and Information Technology, King Abdul Aziz University, Jeddah, Saudi ArabiaHigh Performance Computing Center, King Abdulaziz University, Jeddah, Saudi ArabiaDepartment of Computer Science, Faculty of Computing and Information Technology, King Abdul Aziz University, Jeddah, Saudi ArabiaDepartment of Computer Science, Faculty of Computing and Information Technology, King Abdul Aziz University, Jeddah, Saudi ArabiaSmart cities and other cyber-physical systems (CPSs) rely on various scientific, engineering, business, and social applications that provide timely intelligence for their design, operations, and management. Many of these scientific and analytics applications require the solution of sparse linear equation systems, where sparse matrix-vector (SpMV) product is a key computing operation. Several factors determine the performance of parallel SpMV computations, including matrix characteristics, storage formats, and the rising complexity and heterogeneity of computer systems. There is a pressing need for new ways of exploiting parallelism, and mapping data and applications to the computing resources. We propose here ZAKI+, a data-driven machine-learning approach, allowing users to automatically, effortlessly, and speedily obtain the best configuration (the data distribution, the optimal number of processes, and mapping strategy) and performance for the execution of the parallel SpMV computations on distributed memory machines. We train and test the tool using three machine learning methods-decision trees, random forest, and Xtreme boosting-and nearly 2000 real-world matrices obtained from 45 application domains, including computer vision and robotics. ZAKI+ provides optimal process mapping and outperforms the MPI default mapping policy by a factor of 4.24. This is the first work where the sparsity structure of matrices has been exploited to predict the optimal mapping of processes and data in distributed-memory environments by using different base and ensemble machine learning methods. Various CPSs comprise compute-intensive machine learning applications, such as the SpMV, and hence, the process and data mapping contributions of this paper would be of paramount impact for the CPSs.https://ieeexplore.ieee.org/document/8737900/Cyber-physical systemsSpMVsparse linear algebrasparse matricesmachine learningMPI
collection DOAJ
language English
format Article
sources DOAJ
author Sardar Usman
Rashid Mehmood
Iyad Katib
Aiiad Albeshri
spellingShingle Sardar Usman
Rashid Mehmood
Iyad Katib
Aiiad Albeshri
ZAKI+: A Machine Learning Based Process Mapping Tool for SpMV Computations on Distributed Memory Architectures
IEEE Access
Cyber-physical systems
SpMV
sparse linear algebra
sparse matrices
machine learning
MPI
author_facet Sardar Usman
Rashid Mehmood
Iyad Katib
Aiiad Albeshri
author_sort Sardar Usman
title ZAKI+: A Machine Learning Based Process Mapping Tool for SpMV Computations on Distributed Memory Architectures
title_short ZAKI+: A Machine Learning Based Process Mapping Tool for SpMV Computations on Distributed Memory Architectures
title_full ZAKI+: A Machine Learning Based Process Mapping Tool for SpMV Computations on Distributed Memory Architectures
title_fullStr ZAKI+: A Machine Learning Based Process Mapping Tool for SpMV Computations on Distributed Memory Architectures
title_full_unstemmed ZAKI+: A Machine Learning Based Process Mapping Tool for SpMV Computations on Distributed Memory Architectures
title_sort zaki+: a machine learning based process mapping tool for spmv computations on distributed memory architectures
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2019-01-01
description Smart cities and other cyber-physical systems (CPSs) rely on various scientific, engineering, business, and social applications that provide timely intelligence for their design, operations, and management. Many of these scientific and analytics applications require the solution of sparse linear equation systems, where sparse matrix-vector (SpMV) product is a key computing operation. Several factors determine the performance of parallel SpMV computations, including matrix characteristics, storage formats, and the rising complexity and heterogeneity of computer systems. There is a pressing need for new ways of exploiting parallelism, and mapping data and applications to the computing resources. We propose here ZAKI+, a data-driven machine-learning approach, allowing users to automatically, effortlessly, and speedily obtain the best configuration (the data distribution, the optimal number of processes, and mapping strategy) and performance for the execution of the parallel SpMV computations on distributed memory machines. We train and test the tool using three machine learning methods-decision trees, random forest, and Xtreme boosting-and nearly 2000 real-world matrices obtained from 45 application domains, including computer vision and robotics. ZAKI+ provides optimal process mapping and outperforms the MPI default mapping policy by a factor of 4.24. This is the first work where the sparsity structure of matrices has been exploited to predict the optimal mapping of processes and data in distributed-memory environments by using different base and ensemble machine learning methods. Various CPSs comprise compute-intensive machine learning applications, such as the SpMV, and hence, the process and data mapping contributions of this paper would be of paramount impact for the CPSs.
topic Cyber-physical systems
SpMV
sparse linear algebra
sparse matrices
machine learning
MPI
url https://ieeexplore.ieee.org/document/8737900/
work_keys_str_mv AT sardarusman zakix002bamachinelearningbasedprocessmappingtoolforspmvcomputationsondistributedmemoryarchitectures
AT rashidmehmood zakix002bamachinelearningbasedprocessmappingtoolforspmvcomputationsondistributedmemoryarchitectures
AT iyadkatib zakix002bamachinelearningbasedprocessmappingtoolforspmvcomputationsondistributedmemoryarchitectures
AT aiiadalbeshri zakix002bamachinelearningbasedprocessmappingtoolforspmvcomputationsondistributedmemoryarchitectures
_version_ 1724188646000558080