ALOHA: A Unified Platform-Aware Evaluation Method for CNNs Execution on Heterogeneous Systems at the Edge
CNN design and deployment on embedded edge-processing systems is an error-prone and effort-hungry process, that poses the need for accurate and effective automated assisting tools. In such tools, pre-evaluating the platform-aware CNN metrics such as latency, energy cost, and throughput is a key requ...
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2021-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/9547285/ |
id |
doaj-2d3d2edd6f8846438e18a4532d592528 |
---|---|
record_format |
Article |
spelling |
doaj-2d3d2edd6f8846438e18a4532d5925282021-10-04T23:00:34ZengIEEEIEEE Access2169-35362021-01-01913328913330810.1109/ACCESS.2021.31152439547285ALOHA: A Unified Platform-Aware Evaluation Method for CNNs Execution on Heterogeneous Systems at the EdgePaola Busia0https://orcid.org/0000-0002-1434-9858Svetlana Minakova1Todor Stefanov2https://orcid.org/0000-0001-6006-9366Luigi Raffo3https://orcid.org/0000-0001-9683-009XPaolo Meloni4https://orcid.org/0000-0002-8106-4641DIEE, University of Cagliari, Cagliari, ItalyLIACS, Leiden University, Leiden, CA, The NetherlandsLIACS, Leiden University, Leiden, CA, The NetherlandsDIEE, University of Cagliari, Cagliari, ItalyDIEE, University of Cagliari, Cagliari, ItalyCNN design and deployment on embedded edge-processing systems is an error-prone and effort-hungry process, that poses the need for accurate and effective automated assisting tools. In such tools, pre-evaluating the platform-aware CNN metrics such as latency, energy cost, and throughput is a key requirement for successfully reaching the implementation goals imposed by use-case constraints. Especially when more complex parallel and heterogeneous computing platforms are considered, currently utilized estimation methods are inaccurate or require a lot of characterization experiments and efforts. In this paper, we propose an alternative method, designed to be flexible, easy to use, and accurate at the same time. Considering a modular platform and execution model that adequately describes the details of the platform and the scheduling of different CNN operators on different platform processing elements, our method captures precisely operations and data transfers and their deployment on computing and communication resources, significantly improving the evaluation accuracy. We have tested our method on more than 2000 CNN layers, targeting an FPGA-based accelerator and a GPU platform as reference example architectures. Results have shown that our evaluation method increases the estimation precision by up to <inline-formula> <tex-math notation="LaTeX">$5\times $ </tex-math></inline-formula> for execution time, and by <inline-formula> <tex-math notation="LaTeX">$2\times $ </tex-math></inline-formula> for energy, compared to other widely used analytical methods. Moreover, we assessed the impact of the improved platform-awareness on a set of neural architecture search experiments, targeting both hardware platforms, and enforcing 2 sets of latency constraints, performing 5 trials on each search space, for a total number of 20 experiments. The predictability is improved by <inline-formula> <tex-math notation="LaTeX">$4\times $ </tex-math></inline-formula>, reaching, with respect to alternatives, selection results clearly more similar to those obtained with on-hardware measurements.https://ieeexplore.ieee.org/document/9547285/Convolutional neural networksedge-computingplatform awareness |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Paola Busia Svetlana Minakova Todor Stefanov Luigi Raffo Paolo Meloni |
spellingShingle |
Paola Busia Svetlana Minakova Todor Stefanov Luigi Raffo Paolo Meloni ALOHA: A Unified Platform-Aware Evaluation Method for CNNs Execution on Heterogeneous Systems at the Edge IEEE Access Convolutional neural networks edge-computing platform awareness |
author_facet |
Paola Busia Svetlana Minakova Todor Stefanov Luigi Raffo Paolo Meloni |
author_sort |
Paola Busia |
title |
ALOHA: A Unified Platform-Aware Evaluation Method for CNNs Execution on Heterogeneous Systems at the Edge |
title_short |
ALOHA: A Unified Platform-Aware Evaluation Method for CNNs Execution on Heterogeneous Systems at the Edge |
title_full |
ALOHA: A Unified Platform-Aware Evaluation Method for CNNs Execution on Heterogeneous Systems at the Edge |
title_fullStr |
ALOHA: A Unified Platform-Aware Evaluation Method for CNNs Execution on Heterogeneous Systems at the Edge |
title_full_unstemmed |
ALOHA: A Unified Platform-Aware Evaluation Method for CNNs Execution on Heterogeneous Systems at the Edge |
title_sort |
aloha: a unified platform-aware evaluation method for cnns execution on heterogeneous systems at the edge |
publisher |
IEEE |
series |
IEEE Access |
issn |
2169-3536 |
publishDate |
2021-01-01 |
description |
CNN design and deployment on embedded edge-processing systems is an error-prone and effort-hungry process, that poses the need for accurate and effective automated assisting tools. In such tools, pre-evaluating the platform-aware CNN metrics such as latency, energy cost, and throughput is a key requirement for successfully reaching the implementation goals imposed by use-case constraints. Especially when more complex parallel and heterogeneous computing platforms are considered, currently utilized estimation methods are inaccurate or require a lot of characterization experiments and efforts. In this paper, we propose an alternative method, designed to be flexible, easy to use, and accurate at the same time. Considering a modular platform and execution model that adequately describes the details of the platform and the scheduling of different CNN operators on different platform processing elements, our method captures precisely operations and data transfers and their deployment on computing and communication resources, significantly improving the evaluation accuracy. We have tested our method on more than 2000 CNN layers, targeting an FPGA-based accelerator and a GPU platform as reference example architectures. Results have shown that our evaluation method increases the estimation precision by up to <inline-formula> <tex-math notation="LaTeX">$5\times $ </tex-math></inline-formula> for execution time, and by <inline-formula> <tex-math notation="LaTeX">$2\times $ </tex-math></inline-formula> for energy, compared to other widely used analytical methods. Moreover, we assessed the impact of the improved platform-awareness on a set of neural architecture search experiments, targeting both hardware platforms, and enforcing 2 sets of latency constraints, performing 5 trials on each search space, for a total number of 20 experiments. The predictability is improved by <inline-formula> <tex-math notation="LaTeX">$4\times $ </tex-math></inline-formula>, reaching, with respect to alternatives, selection results clearly more similar to those obtained with on-hardware measurements. |
topic |
Convolutional neural networks edge-computing platform awareness |
url |
https://ieeexplore.ieee.org/document/9547285/ |
work_keys_str_mv |
AT paolabusia alohaaunifiedplatformawareevaluationmethodforcnnsexecutiononheterogeneoussystemsattheedge AT svetlanaminakova alohaaunifiedplatformawareevaluationmethodforcnnsexecutiononheterogeneoussystemsattheedge AT todorstefanov alohaaunifiedplatformawareevaluationmethodforcnnsexecutiononheterogeneoussystemsattheedge AT luigiraffo alohaaunifiedplatformawareevaluationmethodforcnnsexecutiononheterogeneoussystemsattheedge AT paolomeloni alohaaunifiedplatformawareevaluationmethodforcnnsexecutiononheterogeneoussystemsattheedge |
_version_ |
1716843797950758912 |