Multitask Deep Learning models for real-time deployment in embedded systems
Multitask Learning (MTL) was conceived as an approach to improve thegeneralization ability of machine learning models. When applied to neu-ral networks, multitask models take advantage of sharing resources forreducing the total inference time, memory footprint and model size. Wepropose MTL as a way...
Main Author: | |
---|---|
Format: | Others |
Language: | English |
Published: |
KTH, Robotik, perception och lärande, RPL
2017
|
Subjects: | |
Online Access: | http://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-216673 |
id |
ndltd-UPSALLA1-oai-DiVA.org-kth-216673 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-UPSALLA1-oai-DiVA.org-kth-2166732021-05-26T05:45:06ZMultitask Deep Learning models for real-time deployment in embedded systemsengDeep Learning-modeller för multitaskproblem, anpassade för inbyggda system i realtidsapplikationerMartí Rabadán, MiquelKTH, Robotik, perception och lärande, RPLUniversitat Politècnica de Catalunya2017computer visiondeep learningmultitask learningobject detectionsemantic segmentationembedded systemsperceptionroboticsautonomous drivingRoboticsRobotteknik och automationMultitask Learning (MTL) was conceived as an approach to improve thegeneralization ability of machine learning models. When applied to neu-ral networks, multitask models take advantage of sharing resources forreducing the total inference time, memory footprint and model size. Wepropose MTL as a way to speed up deep learning models for applicationsin which multiple tasks need to be solved simultaneously, which is par-ticularly useful in embedded, real-time systems such as the ones foundin autonomous cars or UAVs.In order to study this approach, we apply MTL to a Computer Vi-sion problem in which both Object Detection and Semantic Segmenta-tion tasks are solved based on the Single Shot Multibox Detector andFully Convolutional Networks with skip connections respectively, usinga ResNet-50 as the base network. We train multitask models for twodifferent datasets, Pascal VOC, which is used to validate the decisionsmade, and a combination of datasets with aerial view images capturedfrom UAVs.Finally, we analyse the challenges that appear during the process of train-ing multitask networks and try to overcome them. However, these hinderthe capacity of our multitask models to reach the performance of the bestsingle-task models trained without the limitations imposed by applyingMTL. Nevertheless, multitask networks benefit from sharing resourcesand are 1.6x faster, lighter and use less memory compared to deployingthe single-task models in parallel, which turns essential when runningthem on a Jetson TX1 SoC as the parallel approach does not fit intomemory. We conclude that MTL has the potential to give superior per-formance as far as the object detection and semantic segmentation tasksare concerned in exchange of a more complex training process that re-quires overcoming challenges not present in the training of single-taskmodels. Student thesisinfo:eu-repo/semantics/bachelorThesistexthttp://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-216673EES Examensarbete / Master ThesisEES Examensarbete / Master Thesisapplication/pdfinfo:eu-repo/semantics/openAccess |
collection |
NDLTD |
language |
English |
format |
Others
|
sources |
NDLTD |
topic |
computer vision deep learning multitask learning object detection semantic segmentation embedded systems perception robotics autonomous driving Robotics Robotteknik och automation |
spellingShingle |
computer vision deep learning multitask learning object detection semantic segmentation embedded systems perception robotics autonomous driving Robotics Robotteknik och automation Martí Rabadán, Miquel Multitask Deep Learning models for real-time deployment in embedded systems |
description |
Multitask Learning (MTL) was conceived as an approach to improve thegeneralization ability of machine learning models. When applied to neu-ral networks, multitask models take advantage of sharing resources forreducing the total inference time, memory footprint and model size. Wepropose MTL as a way to speed up deep learning models for applicationsin which multiple tasks need to be solved simultaneously, which is par-ticularly useful in embedded, real-time systems such as the ones foundin autonomous cars or UAVs.In order to study this approach, we apply MTL to a Computer Vi-sion problem in which both Object Detection and Semantic Segmenta-tion tasks are solved based on the Single Shot Multibox Detector andFully Convolutional Networks with skip connections respectively, usinga ResNet-50 as the base network. We train multitask models for twodifferent datasets, Pascal VOC, which is used to validate the decisionsmade, and a combination of datasets with aerial view images capturedfrom UAVs.Finally, we analyse the challenges that appear during the process of train-ing multitask networks and try to overcome them. However, these hinderthe capacity of our multitask models to reach the performance of the bestsingle-task models trained without the limitations imposed by applyingMTL. Nevertheless, multitask networks benefit from sharing resourcesand are 1.6x faster, lighter and use less memory compared to deployingthe single-task models in parallel, which turns essential when runningthem on a Jetson TX1 SoC as the parallel approach does not fit intomemory. We conclude that MTL has the potential to give superior per-formance as far as the object detection and semantic segmentation tasksare concerned in exchange of a more complex training process that re-quires overcoming challenges not present in the training of single-taskmodels. |
author |
Martí Rabadán, Miquel |
author_facet |
Martí Rabadán, Miquel |
author_sort |
Martí Rabadán, Miquel |
title |
Multitask Deep Learning models for real-time deployment in embedded systems |
title_short |
Multitask Deep Learning models for real-time deployment in embedded systems |
title_full |
Multitask Deep Learning models for real-time deployment in embedded systems |
title_fullStr |
Multitask Deep Learning models for real-time deployment in embedded systems |
title_full_unstemmed |
Multitask Deep Learning models for real-time deployment in embedded systems |
title_sort |
multitask deep learning models for real-time deployment in embedded systems |
publisher |
KTH, Robotik, perception och lärande, RPL |
publishDate |
2017 |
url |
http://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-216673 |
work_keys_str_mv |
AT martirabadanmiquel multitaskdeeplearningmodelsforrealtimedeploymentinembeddedsystems AT martirabadanmiquel deeplearningmodellerformultitaskproblemanpassadeforinbyggdasystemirealtidsapplikationer |
_version_ |
1719406634558554112 |