Optimality Assessment of Memory-Bounded ConvNets Deployed on Resource-Constrained RISC Cores
A cost-effective implementation of Convolutional Neural Nets on the mobile edge of the Internet-of-Things (IoT) requires smart optimizations to fit large models into memory-constrained cores. Reduction methods that use a joint combination of filter pruning and weight quantization have proven efficie...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2019-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/8877713/ |
id |
doaj-1ece553dcdfc499c9430882b6ccb0c21 |
---|---|
record_format |
Article |
spelling |
doaj-1ece553dcdfc499c9430882b6ccb0c212021-03-29T23:17:07ZengIEEEIEEE Access2169-35362019-01-01715259915261110.1109/ACCESS.2019.29485778877713Optimality Assessment of Memory-Bounded ConvNets Deployed on Resource-Constrained RISC CoresMatteo Grimaldi0Valentino Peluso1Andrea Calimera2https://orcid.org/0000-0001-5881-3811Politecnico di Torino, Turin, ItalyPolitecnico di Torino, Turin, ItalyPolitecnico di Torino, Turin, ItalyA cost-effective implementation of Convolutional Neural Nets on the mobile edge of the Internet-of-Things (IoT) requires smart optimizations to fit large models into memory-constrained cores. Reduction methods that use a joint combination of filter pruning and weight quantization have proven efficient in searching the compression that ensures minimum model size without accuracy loss. However, there exist other optimal configurations that stem from the memory constraint. The objective of this work is to make an assessment of such memory-bounded implementations and to show that most of them are centred on specific parameter settings that are found difficult to be implemented on a low-power RISC. Hence, the focus is on quantifying the distance to optimality of the closest implementations that instead can be actually deployed on hardware. The analysis is powered by a two-stage framework that efficiently explores the memory-accuracy space using a lightweight, hardware-conscious heuristic optimization. Results are collected from three realistic IoT tasks (Image Classification on CIFAR-10, Keyword Spotting on the Speech Commands Dataset, Facial Expression Recognition on Fer2013) run on RISC cores (Cortex-M by ARM) with few hundreds KB of on-chip RAM.https://ieeexplore.ieee.org/document/8877713/Neural networksInternet of Thingsoptimization methodslow power electronics |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Matteo Grimaldi Valentino Peluso Andrea Calimera |
spellingShingle |
Matteo Grimaldi Valentino Peluso Andrea Calimera Optimality Assessment of Memory-Bounded ConvNets Deployed on Resource-Constrained RISC Cores IEEE Access Neural networks Internet of Things optimization methods low power electronics |
author_facet |
Matteo Grimaldi Valentino Peluso Andrea Calimera |
author_sort |
Matteo Grimaldi |
title |
Optimality Assessment of Memory-Bounded ConvNets Deployed on Resource-Constrained RISC Cores |
title_short |
Optimality Assessment of Memory-Bounded ConvNets Deployed on Resource-Constrained RISC Cores |
title_full |
Optimality Assessment of Memory-Bounded ConvNets Deployed on Resource-Constrained RISC Cores |
title_fullStr |
Optimality Assessment of Memory-Bounded ConvNets Deployed on Resource-Constrained RISC Cores |
title_full_unstemmed |
Optimality Assessment of Memory-Bounded ConvNets Deployed on Resource-Constrained RISC Cores |
title_sort |
optimality assessment of memory-bounded convnets deployed on resource-constrained risc cores |
publisher |
IEEE |
series |
IEEE Access |
issn |
2169-3536 |
publishDate |
2019-01-01 |
description |
A cost-effective implementation of Convolutional Neural Nets on the mobile edge of the Internet-of-Things (IoT) requires smart optimizations to fit large models into memory-constrained cores. Reduction methods that use a joint combination of filter pruning and weight quantization have proven efficient in searching the compression that ensures minimum model size without accuracy loss. However, there exist other optimal configurations that stem from the memory constraint. The objective of this work is to make an assessment of such memory-bounded implementations and to show that most of them are centred on specific parameter settings that are found difficult to be implemented on a low-power RISC. Hence, the focus is on quantifying the distance to optimality of the closest implementations that instead can be actually deployed on hardware. The analysis is powered by a two-stage framework that efficiently explores the memory-accuracy space using a lightweight, hardware-conscious heuristic optimization. Results are collected from three realistic IoT tasks (Image Classification on CIFAR-10, Keyword Spotting on the Speech Commands Dataset, Facial Expression Recognition on Fer2013) run on RISC cores (Cortex-M by ARM) with few hundreds KB of on-chip RAM. |
topic |
Neural networks Internet of Things optimization methods low power electronics |
url |
https://ieeexplore.ieee.org/document/8877713/ |
work_keys_str_mv |
AT matteogrimaldi optimalityassessmentofmemoryboundedconvnetsdeployedonresourceconstrainedrisccores AT valentinopeluso optimalityassessmentofmemoryboundedconvnetsdeployedonresourceconstrainedrisccores AT andreacalimera optimalityassessmentofmemoryboundedconvnetsdeployedonresourceconstrainedrisccores |
_version_ |
1724189768837758976 |