ALOHA: A Unified Platform-Aware Evaluation Method for CNNs Execution on Heterogeneous Systems at the Edge

CNN design and deployment on embedded edge-processing systems is an error-prone and effort-hungry process, that poses the need for accurate and effective automated assisting tools. In such tools, pre-evaluating the platform-aware CNN metrics such as latency, energy cost, and throughput is a key requ...

Full description

Bibliographic Details
Main Authors: Paola Busia, Svetlana Minakova, Todor Stefanov, Luigi Raffo, Paolo Meloni
Format: Article
Language:English
Published: IEEE 2021-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9547285/
id doaj-2d3d2edd6f8846438e18a4532d592528
record_format Article
spelling doaj-2d3d2edd6f8846438e18a4532d5925282021-10-04T23:00:34ZengIEEEIEEE Access2169-35362021-01-01913328913330810.1109/ACCESS.2021.31152439547285ALOHA: A Unified Platform-Aware Evaluation Method for CNNs Execution on Heterogeneous Systems at the EdgePaola Busia0https://orcid.org/0000-0002-1434-9858Svetlana Minakova1Todor Stefanov2https://orcid.org/0000-0001-6006-9366Luigi Raffo3https://orcid.org/0000-0001-9683-009XPaolo Meloni4https://orcid.org/0000-0002-8106-4641DIEE, University of Cagliari, Cagliari, ItalyLIACS, Leiden University, Leiden, CA, The NetherlandsLIACS, Leiden University, Leiden, CA, The NetherlandsDIEE, University of Cagliari, Cagliari, ItalyDIEE, University of Cagliari, Cagliari, ItalyCNN design and deployment on embedded edge-processing systems is an error-prone and effort-hungry process, that poses the need for accurate and effective automated assisting tools. In such tools, pre-evaluating the platform-aware CNN metrics such as latency, energy cost, and throughput is a key requirement for successfully reaching the implementation goals imposed by use-case constraints. Especially when more complex parallel and heterogeneous computing platforms are considered, currently utilized estimation methods are inaccurate or require a lot of characterization experiments and efforts. In this paper, we propose an alternative method, designed to be flexible, easy to use, and accurate at the same time. Considering a modular platform and execution model that adequately describes the details of the platform and the scheduling of different CNN operators on different platform processing elements, our method captures precisely operations and data transfers and their deployment on computing and communication resources, significantly improving the evaluation accuracy. We have tested our method on more than 2000 CNN layers, targeting an FPGA-based accelerator and a GPU platform as reference example architectures. Results have shown that our evaluation method increases the estimation precision by up to <inline-formula> <tex-math notation="LaTeX">$5\times $ </tex-math></inline-formula> for execution time, and by <inline-formula> <tex-math notation="LaTeX">$2\times $ </tex-math></inline-formula> for energy, compared to other widely used analytical methods. Moreover, we assessed the impact of the improved platform-awareness on a set of neural architecture search experiments, targeting both hardware platforms, and enforcing 2 sets of latency constraints, performing 5 trials on each search space, for a total number of 20 experiments. The predictability is improved by <inline-formula> <tex-math notation="LaTeX">$4\times $ </tex-math></inline-formula>, reaching, with respect to alternatives, selection results clearly more similar to those obtained with on-hardware measurements.https://ieeexplore.ieee.org/document/9547285/Convolutional neural networksedge-computingplatform awareness
collection DOAJ
language English
format Article
sources DOAJ
author Paola Busia
Svetlana Minakova
Todor Stefanov
Luigi Raffo
Paolo Meloni
spellingShingle Paola Busia
Svetlana Minakova
Todor Stefanov
Luigi Raffo
Paolo Meloni
ALOHA: A Unified Platform-Aware Evaluation Method for CNNs Execution on Heterogeneous Systems at the Edge
IEEE Access
Convolutional neural networks
edge-computing
platform awareness
author_facet Paola Busia
Svetlana Minakova
Todor Stefanov
Luigi Raffo
Paolo Meloni
author_sort Paola Busia
title ALOHA: A Unified Platform-Aware Evaluation Method for CNNs Execution on Heterogeneous Systems at the Edge
title_short ALOHA: A Unified Platform-Aware Evaluation Method for CNNs Execution on Heterogeneous Systems at the Edge
title_full ALOHA: A Unified Platform-Aware Evaluation Method for CNNs Execution on Heterogeneous Systems at the Edge
title_fullStr ALOHA: A Unified Platform-Aware Evaluation Method for CNNs Execution on Heterogeneous Systems at the Edge
title_full_unstemmed ALOHA: A Unified Platform-Aware Evaluation Method for CNNs Execution on Heterogeneous Systems at the Edge
title_sort aloha: a unified platform-aware evaluation method for cnns execution on heterogeneous systems at the edge
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2021-01-01
description CNN design and deployment on embedded edge-processing systems is an error-prone and effort-hungry process, that poses the need for accurate and effective automated assisting tools. In such tools, pre-evaluating the platform-aware CNN metrics such as latency, energy cost, and throughput is a key requirement for successfully reaching the implementation goals imposed by use-case constraints. Especially when more complex parallel and heterogeneous computing platforms are considered, currently utilized estimation methods are inaccurate or require a lot of characterization experiments and efforts. In this paper, we propose an alternative method, designed to be flexible, easy to use, and accurate at the same time. Considering a modular platform and execution model that adequately describes the details of the platform and the scheduling of different CNN operators on different platform processing elements, our method captures precisely operations and data transfers and their deployment on computing and communication resources, significantly improving the evaluation accuracy. We have tested our method on more than 2000 CNN layers, targeting an FPGA-based accelerator and a GPU platform as reference example architectures. Results have shown that our evaluation method increases the estimation precision by up to <inline-formula> <tex-math notation="LaTeX">$5\times $ </tex-math></inline-formula> for execution time, and by <inline-formula> <tex-math notation="LaTeX">$2\times $ </tex-math></inline-formula> for energy, compared to other widely used analytical methods. Moreover, we assessed the impact of the improved platform-awareness on a set of neural architecture search experiments, targeting both hardware platforms, and enforcing 2 sets of latency constraints, performing 5 trials on each search space, for a total number of 20 experiments. The predictability is improved by <inline-formula> <tex-math notation="LaTeX">$4\times $ </tex-math></inline-formula>, reaching, with respect to alternatives, selection results clearly more similar to those obtained with on-hardware measurements.
topic Convolutional neural networks
edge-computing
platform awareness
url https://ieeexplore.ieee.org/document/9547285/
work_keys_str_mv AT paolabusia alohaaunifiedplatformawareevaluationmethodforcnnsexecutiononheterogeneoussystemsattheedge
AT svetlanaminakova alohaaunifiedplatformawareevaluationmethodforcnnsexecutiononheterogeneoussystemsattheedge
AT todorstefanov alohaaunifiedplatformawareevaluationmethodforcnnsexecutiononheterogeneoussystemsattheedge
AT luigiraffo alohaaunifiedplatformawareevaluationmethodforcnnsexecutiononheterogeneoussystemsattheedge
AT paolomeloni alohaaunifiedplatformawareevaluationmethodforcnnsexecutiononheterogeneoussystemsattheedge
_version_ 1716843797950758912