Multi-task Deep Learning Networks with Machine-Train-Machine Migration Learning for Pose Estimation and Depth Prediction
碩士 === 國立交通大學 === 資訊科學與工程研究所 === 108 === Deep learning faces two barriers, having needs on large numbers of training data and a huge amount of numerical operations respectively. Distressingly, the process of labeling training data is strenuous. Complex deep learning networks typically spend time on...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | en_US |
Published: |
2019
|
Online Access: | http://ndltd.ncl.edu.tw/handle/bf62un |
id |
ndltd-TW-108NCTU5394011 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-108NCTU53940112019-11-26T05:16:54Z http://ndltd.ncl.edu.tw/handle/bf62un Multi-task Deep Learning Networks with Machine-Train-Machine Migration Learning for Pose Estimation and Depth Prediction 基於機器訓練機器技術建構姿態與深度估算的多任務深度學習網路模型 Shih, Bo-Wun 施博文 碩士 國立交通大學 資訊科學與工程研究所 108 Deep learning faces two barriers, having needs on large numbers of training data and a huge amount of numerical operations respectively. Distressingly, the process of labeling training data is strenuous. Complex deep learning networks typically spend time on the tens of thousands of computations, not to mention running multiple deep learning networks for different purposes simultaneously. Consequently, the automatic training data generation method, machine-train-machine migration learning, is proposed and verified in this work with the demonstration on Kinect. The design of the multi-task deep learning network is introduced between different networks to reduce computation time on the Kinect imitation. The performance of the multi-task deep learning network on skeleton detection and depth prediction, the performance of the software Kinect, are 93.5% in PCKh metric and 94.4% in A2 metric. Notably, the possible positive relationship of skeleton detection and depth prediction from the performance change on multi-task design is also mentioned. Most importantly, the time-complexity of the multi-task deep learning network is verified to indicate the actual detection efficiency improvement from the multi-task design with the processing speed of 52.14 fps. At last, the stereo pose information from our multi-task deep learning model is then utilized for the application on vision-based fitness e-coaching. İk, Tsì-Uí 易志偉 2019 學位論文 ; thesis 52 en_US |
collection |
NDLTD |
language |
en_US |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立交通大學 === 資訊科學與工程研究所 === 108 === Deep learning faces two barriers, having needs on large numbers of training data and a huge amount of numerical operations respectively. Distressingly, the process of labeling training data is strenuous. Complex deep learning networks typically spend time on the tens of thousands of computations, not to mention running multiple deep learning networks for different purposes simultaneously. Consequently, the automatic training data generation method, machine-train-machine migration learning, is proposed and verified in this work with the demonstration on Kinect. The design of the multi-task deep learning network is introduced between different networks to reduce computation time on the Kinect imitation. The performance of the multi-task deep learning network on skeleton detection and depth prediction, the performance of the software Kinect, are 93.5% in PCKh metric and 94.4% in A2 metric. Notably, the possible positive relationship of skeleton detection and depth prediction from the performance change on multi-task design is also mentioned. Most importantly, the time-complexity of the multi-task deep learning network is verified to indicate the actual detection efficiency improvement from the multi-task design with the processing speed of 52.14 fps. At last, the stereo pose information from our multi-task deep learning model is then utilized for the application on vision-based fitness e-coaching.
|
author2 |
İk, Tsì-Uí |
author_facet |
İk, Tsì-Uí Shih, Bo-Wun 施博文 |
author |
Shih, Bo-Wun 施博文 |
spellingShingle |
Shih, Bo-Wun 施博文 Multi-task Deep Learning Networks with Machine-Train-Machine Migration Learning for Pose Estimation and Depth Prediction |
author_sort |
Shih, Bo-Wun |
title |
Multi-task Deep Learning Networks with Machine-Train-Machine Migration Learning for Pose Estimation and Depth Prediction |
title_short |
Multi-task Deep Learning Networks with Machine-Train-Machine Migration Learning for Pose Estimation and Depth Prediction |
title_full |
Multi-task Deep Learning Networks with Machine-Train-Machine Migration Learning for Pose Estimation and Depth Prediction |
title_fullStr |
Multi-task Deep Learning Networks with Machine-Train-Machine Migration Learning for Pose Estimation and Depth Prediction |
title_full_unstemmed |
Multi-task Deep Learning Networks with Machine-Train-Machine Migration Learning for Pose Estimation and Depth Prediction |
title_sort |
multi-task deep learning networks with machine-train-machine migration learning for pose estimation and depth prediction |
publishDate |
2019 |
url |
http://ndltd.ncl.edu.tw/handle/bf62un |
work_keys_str_mv |
AT shihbowun multitaskdeeplearningnetworkswithmachinetrainmachinemigrationlearningforposeestimationanddepthprediction AT shībówén multitaskdeeplearningnetworkswithmachinetrainmachinemigrationlearningforposeestimationanddepthprediction AT shihbowun jīyújīqìxùnliànjīqìjìshùjiàngòuzītàiyǔshēndùgūsuàndeduōrènwùshēndùxuéxíwǎnglùmóxíng AT shībówén jīyújīqìxùnliànjīqìjìshùjiàngòuzītàiyǔshēndùgūsuàndeduōrènwùshēndùxuéxíwǎnglùmóxíng |
_version_ |
1719296654623899648 |