Multitask Deep Learning models for real-time deployment in embedded systems

Multitask Learning (MTL) was conceived as an approach to improve thegeneralization ability of machine learning models. When applied to neu-ral networks, multitask models take advantage of sharing resources forreducing the total inference time, memory footprint and model size. Wepropose MTL as a way...

Full description

Bibliographic Details
Main Author:	Martí Rabadán, Miquel
Format:	Others
Language:	English
Published:	KTH, Robotik, perception och lärande, RPL 2017
Subjects:	computer vision deep learning multitask learning object detection semantic segmentation embedded systems perception robotics autonomous driving Robotics Robotteknik och automation
Online Access:	http://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-216673

id	ndltd-UPSALLA1-oai-DiVA.org-kth-216673
record_format	oai_dc
spelling	ndltd-UPSALLA1-oai-DiVA.org-kth-2166732021-05-26T05:45:06ZMultitask Deep Learning models for real-time deployment in embedded systemsengDeep Learning-modeller för multitaskproblem, anpassade för inbyggda system i realtidsapplikationerMartí Rabadán, MiquelKTH, Robotik, perception och lärande, RPLUniversitat Politècnica de Catalunya2017computer visiondeep learningmultitask learningobject detectionsemantic segmentationembedded systemsperceptionroboticsautonomous drivingRoboticsRobotteknik och automationMultitask Learning (MTL) was conceived as an approach to improve thegeneralization ability of machine learning models. When applied to neu-ral networks, multitask models take advantage of sharing resources forreducing the total inference time, memory footprint and model size. Wepropose MTL as a way to speed up deep learning models for applicationsin which multiple tasks need to be solved simultaneously, which is par-ticularly useful in embedded, real-time systems such as the ones foundin autonomous cars or UAVs.In order to study this approach, we apply MTL to a Computer Vi-sion problem in which both Object Detection and Semantic Segmenta-tion tasks are solved based on the Single Shot Multibox Detector andFully Convolutional Networks with skip connections respectively, usinga ResNet-50 as the base network. We train multitask models for twodifferent datasets, Pascal VOC, which is used to validate the decisionsmade, and a combination of datasets with aerial view images capturedfrom UAVs.Finally, we analyse the challenges that appear during the process of train-ing multitask networks and try to overcome them. However, these hinderthe capacity of our multitask models to reach the performance of the bestsingle-task models trained without the limitations imposed by applyingMTL. Nevertheless, multitask networks benefit from sharing resourcesand are 1.6x faster, lighter and use less memory compared to deployingthe single-task models in parallel, which turns essential when runningthem on a Jetson TX1 SoC as the parallel approach does not fit intomemory. We conclude that MTL has the potential to give superior per-formance as far as the object detection and semantic segmentation tasksare concerned in exchange of a more complex training process that re-quires overcoming challenges not present in the training of single-taskmodels. Student thesisinfo:eu-repo/semantics/bachelorThesistexthttp://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-216673EES Examensarbete / Master ThesisEES Examensarbete / Master Thesisapplication/pdfinfo:eu-repo/semantics/openAccess
collection	NDLTD
language	English
format	Others
sources	NDLTD
topic	computer vision deep learning multitask learning object detection semantic segmentation embedded systems perception robotics autonomous driving Robotics Robotteknik och automation
spellingShingle	computer vision deep learning multitask learning object detection semantic segmentation embedded systems perception robotics autonomous driving Robotics Robotteknik och automation Martí Rabadán, Miquel Multitask Deep Learning models for real-time deployment in embedded systems
description	Multitask Learning (MTL) was conceived as an approach to improve thegeneralization ability of machine learning models. When applied to neu-ral networks, multitask models take advantage of sharing resources forreducing the total inference time, memory footprint and model size. Wepropose MTL as a way to speed up deep learning models for applicationsin which multiple tasks need to be solved simultaneously, which is par-ticularly useful in embedded, real-time systems such as the ones foundin autonomous cars or UAVs.In order to study this approach, we apply MTL to a Computer Vi-sion problem in which both Object Detection and Semantic Segmenta-tion tasks are solved based on the Single Shot Multibox Detector andFully Convolutional Networks with skip connections respectively, usinga ResNet-50 as the base network. We train multitask models for twodifferent datasets, Pascal VOC, which is used to validate the decisionsmade, and a combination of datasets with aerial view images capturedfrom UAVs.Finally, we analyse the challenges that appear during the process of train-ing multitask networks and try to overcome them. However, these hinderthe capacity of our multitask models to reach the performance of the bestsingle-task models trained without the limitations imposed by applyingMTL. Nevertheless, multitask networks benefit from sharing resourcesand are 1.6x faster, lighter and use less memory compared to deployingthe single-task models in parallel, which turns essential when runningthem on a Jetson TX1 SoC as the parallel approach does not fit intomemory. We conclude that MTL has the potential to give superior per-formance as far as the object detection and semantic segmentation tasksare concerned in exchange of a more complex training process that re-quires overcoming challenges not present in the training of single-taskmodels.
author	Martí Rabadán, Miquel
author_facet	Martí Rabadán, Miquel
author_sort	Martí Rabadán, Miquel
title	Multitask Deep Learning models for real-time deployment in embedded systems
title_short	Multitask Deep Learning models for real-time deployment in embedded systems
title_full	Multitask Deep Learning models for real-time deployment in embedded systems
title_fullStr	Multitask Deep Learning models for real-time deployment in embedded systems
title_full_unstemmed	Multitask Deep Learning models for real-time deployment in embedded systems
title_sort	multitask deep learning models for real-time deployment in embedded systems
publisher	KTH, Robotik, perception och lärande, RPL
publishDate	2017
url	http://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-216673
work_keys_str_mv	AT martirabadanmiquel multitaskdeeplearningmodelsforrealtimedeploymentinembeddedsystems AT martirabadanmiquel deeplearningmodellerformultitaskproblemanpassadeforinbyggdasystemirealtidsapplikationer
_version_	1719406634558554112

Multitask Deep Learning models for real-time deployment in embedded systems

Similar Items