Multi-task Deep Learning Networks with Machine-Train-Machine Migration Learning for Pose Estimation and Depth Prediction

碩士 === 國立交通大學 === 資訊科學與工程研究所 === 108 === Deep learning faces two barriers, having needs on large numbers of training data and a huge amount of numerical operations respectively. Distressingly, the process of labeling training data is strenuous. Complex deep learning networks typically spend time on...

Full description

Bibliographic Details
Main Authors: Shih, Bo-Wun, 施博文
Other Authors: İk, Tsì-Uí
Format: Others
Language:en_US
Published: 2019
Online Access:http://ndltd.ncl.edu.tw/handle/bf62un
id ndltd-TW-108NCTU5394011
record_format oai_dc
spelling ndltd-TW-108NCTU53940112019-11-26T05:16:54Z http://ndltd.ncl.edu.tw/handle/bf62un Multi-task Deep Learning Networks with Machine-Train-Machine Migration Learning for Pose Estimation and Depth Prediction 基於機器訓練機器技術建構姿態與深度估算的多任務深度學習網路模型 Shih, Bo-Wun 施博文 碩士 國立交通大學 資訊科學與工程研究所 108 Deep learning faces two barriers, having needs on large numbers of training data and a huge amount of numerical operations respectively. Distressingly, the process of labeling training data is strenuous. Complex deep learning networks typically spend time on the tens of thousands of computations, not to mention running multiple deep learning networks for different purposes simultaneously. Consequently, the automatic training data generation method, machine-train-machine migration learning, is proposed and verified in this work with the demonstration on Kinect. The design of the multi-task deep learning network is introduced between different networks to reduce computation time on the Kinect imitation. The performance of the multi-task deep learning network on skeleton detection and depth prediction, the performance of the software Kinect, are 93.5% in PCKh metric and 94.4% in A2 metric. Notably, the possible positive relationship of skeleton detection and depth prediction from the performance change on multi-task design is also mentioned. Most importantly, the time-complexity of the multi-task deep learning network is verified to indicate the actual detection efficiency improvement from the multi-task design with the processing speed of 52.14 fps. At last, the stereo pose information from our multi-task deep learning model is then utilized for the application on vision-based fitness e-coaching. İk, Tsì-Uí 易志偉 2019 學位論文 ; thesis 52 en_US
collection NDLTD
language en_US
format Others
sources NDLTD
description 碩士 === 國立交通大學 === 資訊科學與工程研究所 === 108 === Deep learning faces two barriers, having needs on large numbers of training data and a huge amount of numerical operations respectively. Distressingly, the process of labeling training data is strenuous. Complex deep learning networks typically spend time on the tens of thousands of computations, not to mention running multiple deep learning networks for different purposes simultaneously. Consequently, the automatic training data generation method, machine-train-machine migration learning, is proposed and verified in this work with the demonstration on Kinect. The design of the multi-task deep learning network is introduced between different networks to reduce computation time on the Kinect imitation. The performance of the multi-task deep learning network on skeleton detection and depth prediction, the performance of the software Kinect, are 93.5% in PCKh metric and 94.4% in A2 metric. Notably, the possible positive relationship of skeleton detection and depth prediction from the performance change on multi-task design is also mentioned. Most importantly, the time-complexity of the multi-task deep learning network is verified to indicate the actual detection efficiency improvement from the multi-task design with the processing speed of 52.14 fps. At last, the stereo pose information from our multi-task deep learning model is then utilized for the application on vision-based fitness e-coaching.
author2 İk, Tsì-Uí
author_facet İk, Tsì-Uí
Shih, Bo-Wun
施博文
author Shih, Bo-Wun
施博文
spellingShingle Shih, Bo-Wun
施博文
Multi-task Deep Learning Networks with Machine-Train-Machine Migration Learning for Pose Estimation and Depth Prediction
author_sort Shih, Bo-Wun
title Multi-task Deep Learning Networks with Machine-Train-Machine Migration Learning for Pose Estimation and Depth Prediction
title_short Multi-task Deep Learning Networks with Machine-Train-Machine Migration Learning for Pose Estimation and Depth Prediction
title_full Multi-task Deep Learning Networks with Machine-Train-Machine Migration Learning for Pose Estimation and Depth Prediction
title_fullStr Multi-task Deep Learning Networks with Machine-Train-Machine Migration Learning for Pose Estimation and Depth Prediction
title_full_unstemmed Multi-task Deep Learning Networks with Machine-Train-Machine Migration Learning for Pose Estimation and Depth Prediction
title_sort multi-task deep learning networks with machine-train-machine migration learning for pose estimation and depth prediction
publishDate 2019
url http://ndltd.ncl.edu.tw/handle/bf62un
work_keys_str_mv AT shihbowun multitaskdeeplearningnetworkswithmachinetrainmachinemigrationlearningforposeestimationanddepthprediction
AT shībówén multitaskdeeplearningnetworkswithmachinetrainmachinemigrationlearningforposeestimationanddepthprediction
AT shihbowun jīyújīqìxùnliànjīqìjìshùjiàngòuzītàiyǔshēndùgūsuàndeduōrènwùshēndùxuéxíwǎnglùmóxíng
AT shībówén jīyújīqìxùnliànjīqìjìshùjiàngòuzītàiyǔshēndùgūsuàndeduōrènwùshēndùxuéxíwǎnglùmóxíng
_version_ 1719296654623899648