Optimality Assessment of Memory-Bounded ConvNets Deployed on Resource-Constrained RISC Cores

A cost-effective implementation of Convolutional Neural Nets on the mobile edge of the Internet-of-Things (IoT) requires smart optimizations to fit large models into memory-constrained cores. Reduction methods that use a joint combination of filter pruning and weight quantization have proven efficie...

Full description

Bibliographic Details
Main Authors: Matteo Grimaldi, Valentino Peluso, Andrea Calimera
Format: Article
Language:English
Published: IEEE 2019-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/8877713/
id doaj-1ece553dcdfc499c9430882b6ccb0c21
record_format Article
spelling doaj-1ece553dcdfc499c9430882b6ccb0c212021-03-29T23:17:07ZengIEEEIEEE Access2169-35362019-01-01715259915261110.1109/ACCESS.2019.29485778877713Optimality Assessment of Memory-Bounded ConvNets Deployed on Resource-Constrained RISC CoresMatteo Grimaldi0Valentino Peluso1Andrea Calimera2https://orcid.org/0000-0001-5881-3811Politecnico di Torino, Turin, ItalyPolitecnico di Torino, Turin, ItalyPolitecnico di Torino, Turin, ItalyA cost-effective implementation of Convolutional Neural Nets on the mobile edge of the Internet-of-Things (IoT) requires smart optimizations to fit large models into memory-constrained cores. Reduction methods that use a joint combination of filter pruning and weight quantization have proven efficient in searching the compression that ensures minimum model size without accuracy loss. However, there exist other optimal configurations that stem from the memory constraint. The objective of this work is to make an assessment of such memory-bounded implementations and to show that most of them are centred on specific parameter settings that are found difficult to be implemented on a low-power RISC. Hence, the focus is on quantifying the distance to optimality of the closest implementations that instead can be actually deployed on hardware. The analysis is powered by a two-stage framework that efficiently explores the memory-accuracy space using a lightweight, hardware-conscious heuristic optimization. Results are collected from three realistic IoT tasks (Image Classification on CIFAR-10, Keyword Spotting on the Speech Commands Dataset, Facial Expression Recognition on Fer2013) run on RISC cores (Cortex-M by ARM) with few hundreds KB of on-chip RAM.https://ieeexplore.ieee.org/document/8877713/Neural networksInternet of Thingsoptimization methodslow power electronics
collection DOAJ
language English
format Article
sources DOAJ
author Matteo Grimaldi
Valentino Peluso
Andrea Calimera
spellingShingle Matteo Grimaldi
Valentino Peluso
Andrea Calimera
Optimality Assessment of Memory-Bounded ConvNets Deployed on Resource-Constrained RISC Cores
IEEE Access
Neural networks
Internet of Things
optimization methods
low power electronics
author_facet Matteo Grimaldi
Valentino Peluso
Andrea Calimera
author_sort Matteo Grimaldi
title Optimality Assessment of Memory-Bounded ConvNets Deployed on Resource-Constrained RISC Cores
title_short Optimality Assessment of Memory-Bounded ConvNets Deployed on Resource-Constrained RISC Cores
title_full Optimality Assessment of Memory-Bounded ConvNets Deployed on Resource-Constrained RISC Cores
title_fullStr Optimality Assessment of Memory-Bounded ConvNets Deployed on Resource-Constrained RISC Cores
title_full_unstemmed Optimality Assessment of Memory-Bounded ConvNets Deployed on Resource-Constrained RISC Cores
title_sort optimality assessment of memory-bounded convnets deployed on resource-constrained risc cores
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2019-01-01
description A cost-effective implementation of Convolutional Neural Nets on the mobile edge of the Internet-of-Things (IoT) requires smart optimizations to fit large models into memory-constrained cores. Reduction methods that use a joint combination of filter pruning and weight quantization have proven efficient in searching the compression that ensures minimum model size without accuracy loss. However, there exist other optimal configurations that stem from the memory constraint. The objective of this work is to make an assessment of such memory-bounded implementations and to show that most of them are centred on specific parameter settings that are found difficult to be implemented on a low-power RISC. Hence, the focus is on quantifying the distance to optimality of the closest implementations that instead can be actually deployed on hardware. The analysis is powered by a two-stage framework that efficiently explores the memory-accuracy space using a lightweight, hardware-conscious heuristic optimization. Results are collected from three realistic IoT tasks (Image Classification on CIFAR-10, Keyword Spotting on the Speech Commands Dataset, Facial Expression Recognition on Fer2013) run on RISC cores (Cortex-M by ARM) with few hundreds KB of on-chip RAM.
topic Neural networks
Internet of Things
optimization methods
low power electronics
url https://ieeexplore.ieee.org/document/8877713/
work_keys_str_mv AT matteogrimaldi optimalityassessmentofmemoryboundedconvnetsdeployedonresourceconstrainedrisccores
AT valentinopeluso optimalityassessmentofmemoryboundedconvnetsdeployedonresourceconstrainedrisccores
AT andreacalimera optimalityassessmentofmemoryboundedconvnetsdeployedonresourceconstrainedrisccores
_version_ 1724189768837758976