Multitask Deep Learning models for real-time deployment in embedded systems

Multitask Learning (MTL) was conceived as an approach to improve thegeneralization ability of machine learning models. When applied to neu-ral networks, multitask models take advantage of sharing resources forreducing the total inference time, memory footprint and model size. Wepropose MTL as a way...

Full description

Bibliographic Details
Main Author: Martí Rabadán, Miquel
Format: Others
Language:English
Published: KTH, Robotik, perception och lärande, RPL 2017
Subjects:
Online Access:http://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-216673
id ndltd-UPSALLA1-oai-DiVA.org-kth-216673
record_format oai_dc
spelling ndltd-UPSALLA1-oai-DiVA.org-kth-2166732021-05-26T05:45:06ZMultitask Deep Learning models for real-time deployment in embedded systemsengDeep Learning-modeller för multitaskproblem, anpassade för inbyggda system i realtidsapplikationerMartí Rabadán, MiquelKTH, Robotik, perception och lärande, RPLUniversitat Politècnica de Catalunya2017computer visiondeep learningmultitask learningobject detectionsemantic segmentationembedded systemsperceptionroboticsautonomous drivingRoboticsRobotteknik och automationMultitask Learning (MTL) was conceived as an approach to improve thegeneralization ability of machine learning models. When applied to neu-ral networks, multitask models take advantage of sharing resources forreducing the total inference time, memory footprint and model size. Wepropose MTL as a way to speed up deep learning models for applicationsin which multiple tasks need to be solved simultaneously, which is par-ticularly useful in embedded, real-time systems such as the ones foundin autonomous cars or UAVs.In order to study this approach, we apply MTL to a Computer Vi-sion problem in which both Object Detection and Semantic Segmenta-tion tasks are solved based on the Single Shot Multibox Detector andFully Convolutional Networks with skip connections respectively, usinga ResNet-50 as the base network. We train multitask models for twodifferent datasets, Pascal VOC, which is used to validate the decisionsmade, and a combination of datasets with aerial view images capturedfrom UAVs.Finally, we analyse the challenges that appear during the process of train-ing multitask networks and try to overcome them. However, these hinderthe capacity of our multitask models to reach the performance of the bestsingle-task models trained without the limitations imposed by applyingMTL. Nevertheless, multitask networks benefit from sharing resourcesand are 1.6x faster, lighter and use less memory compared to deployingthe single-task models in parallel, which turns essential when runningthem on a Jetson TX1 SoC as the parallel approach does not fit intomemory. We conclude that MTL has the potential to give superior per-formance as far as the object detection and semantic segmentation tasksare concerned in exchange of a more complex training process that re-quires overcoming challenges not present in the training of single-taskmodels. Student thesisinfo:eu-repo/semantics/bachelorThesistexthttp://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-216673EES Examensarbete / Master ThesisEES Examensarbete / Master Thesisapplication/pdfinfo:eu-repo/semantics/openAccess
collection NDLTD
language English
format Others
sources NDLTD
topic computer vision
deep learning
multitask learning
object detection
semantic segmentation
embedded systems
perception
robotics
autonomous driving
Robotics
Robotteknik och automation
spellingShingle computer vision
deep learning
multitask learning
object detection
semantic segmentation
embedded systems
perception
robotics
autonomous driving
Robotics
Robotteknik och automation
Martí Rabadán, Miquel
Multitask Deep Learning models for real-time deployment in embedded systems
description Multitask Learning (MTL) was conceived as an approach to improve thegeneralization ability of machine learning models. When applied to neu-ral networks, multitask models take advantage of sharing resources forreducing the total inference time, memory footprint and model size. Wepropose MTL as a way to speed up deep learning models for applicationsin which multiple tasks need to be solved simultaneously, which is par-ticularly useful in embedded, real-time systems such as the ones foundin autonomous cars or UAVs.In order to study this approach, we apply MTL to a Computer Vi-sion problem in which both Object Detection and Semantic Segmenta-tion tasks are solved based on the Single Shot Multibox Detector andFully Convolutional Networks with skip connections respectively, usinga ResNet-50 as the base network. We train multitask models for twodifferent datasets, Pascal VOC, which is used to validate the decisionsmade, and a combination of datasets with aerial view images capturedfrom UAVs.Finally, we analyse the challenges that appear during the process of train-ing multitask networks and try to overcome them. However, these hinderthe capacity of our multitask models to reach the performance of the bestsingle-task models trained without the limitations imposed by applyingMTL. Nevertheless, multitask networks benefit from sharing resourcesand are 1.6x faster, lighter and use less memory compared to deployingthe single-task models in parallel, which turns essential when runningthem on a Jetson TX1 SoC as the parallel approach does not fit intomemory. We conclude that MTL has the potential to give superior per-formance as far as the object detection and semantic segmentation tasksare concerned in exchange of a more complex training process that re-quires overcoming challenges not present in the training of single-taskmodels.
author Martí Rabadán, Miquel
author_facet Martí Rabadán, Miquel
author_sort Martí Rabadán, Miquel
title Multitask Deep Learning models for real-time deployment in embedded systems
title_short Multitask Deep Learning models for real-time deployment in embedded systems
title_full Multitask Deep Learning models for real-time deployment in embedded systems
title_fullStr Multitask Deep Learning models for real-time deployment in embedded systems
title_full_unstemmed Multitask Deep Learning models for real-time deployment in embedded systems
title_sort multitask deep learning models for real-time deployment in embedded systems
publisher KTH, Robotik, perception och lärande, RPL
publishDate 2017
url http://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-216673
work_keys_str_mv AT martirabadanmiquel multitaskdeeplearningmodelsforrealtimedeploymentinembeddedsystems
AT martirabadanmiquel deeplearningmodellerformultitaskproblemanpassadeforinbyggdasystemirealtidsapplikationer
_version_ 1719406634558554112