Deep Learning Techniques for RGB-D Visual Recognition System

碩士 === 國立臺灣科技大學 === 資訊工程系 === 104 === Data fusion from different modalities has been extensively studied for a better understanding of multimedia contents. On one hand, the emergence of new devices and decreasing storage costs cause growing amounts of data being collected. Though bigger data makes i...

Full description

Bibliographic Details
Main Authors:	Yuan-Sheng Hsiao, 蕭元昇
Other Authors:	Kai-Lung Hua
Format:	Others
Language:	zh-TW
Published:	2016
Online Access:	http://ndltd.ncl.edu.tw/handle/10340938691976874559

id	ndltd-TW-104NTUS5392061
record_format	oai_dc
spelling	ndltd-TW-104NTUS53920612017-09-24T04:40:50Z http://ndltd.ncl.edu.tw/handle/10340938691976874559 Deep Learning Techniques for RGB-D Visual Recognition System 基於深度學習之 RGB-D 視覺辨識系統 Yuan-Sheng Hsiao 蕭元昇碩士國立臺灣科技大學資訊工程系 104 Data fusion from different modalities has been extensively studied for a better understanding of multimedia contents. On one hand, the emergence of new devices and decreasing storage costs cause growing amounts of data being collected. Though bigger data makes it easier to mine information, methods for big data analytics are not well investigated. On the other hand, new machine learning techniques, such as deep learning, have been shown to be one of the the key elements in achieving state-of-the-art inference performances in a variety of applications. Therefore, some of the old questions in data fusion are in need to be addressed again for these new changes. These questions are: What is the most effective way to combine data for various modalities? Does the fusion method affect the performance with different classifiers? To answer these questions, in this paper, we present a comparative study for evaluating early and late fusion schemes with several types of SVM and deep learning classifiers on two challenging RGB-D based visual recognition tasks: hand gesture recognition and generic object recognition. The findings from this study provide useful policy and practical guidance for the development of visual recognition systems. Kai-Lung Hua 花凱龍 2016 學位論文 ; thesis 50 zh-TW
collection	NDLTD
language	zh-TW
format	Others
sources	NDLTD
description	碩士 === 國立臺灣科技大學 === 資訊工程系 === 104 === Data fusion from different modalities has been extensively studied for a better understanding of multimedia contents. On one hand, the emergence of new devices and decreasing storage costs cause growing amounts of data being collected. Though bigger data makes it easier to mine information, methods for big data analytics are not well investigated. On the other hand, new machine learning techniques, such as deep learning, have been shown to be one of the the key elements in achieving state-of-the-art inference performances in a variety of applications. Therefore, some of the old questions in data fusion are in need to be addressed again for these new changes. These questions are: What is the most effective way to combine data for various modalities? Does the fusion method affect the performance with different classifiers? To answer these questions, in this paper, we present a comparative study for evaluating early and late fusion schemes with several types of SVM and deep learning classifiers on two challenging RGB-D based visual recognition tasks: hand gesture recognition and generic object recognition. The findings from this study provide useful policy and practical guidance for the development of visual recognition systems.
author2	Kai-Lung Hua
author_facet	Kai-Lung Hua Yuan-Sheng Hsiao 蕭元昇
author	Yuan-Sheng Hsiao 蕭元昇
spellingShingle	Yuan-Sheng Hsiao 蕭元昇 Deep Learning Techniques for RGB-D Visual Recognition System
author_sort	Yuan-Sheng Hsiao
title	Deep Learning Techniques for RGB-D Visual Recognition System
title_short	Deep Learning Techniques for RGB-D Visual Recognition System
title_full	Deep Learning Techniques for RGB-D Visual Recognition System
title_fullStr	Deep Learning Techniques for RGB-D Visual Recognition System
title_full_unstemmed	Deep Learning Techniques for RGB-D Visual Recognition System
title_sort	deep learning techniques for rgb-d visual recognition system
publishDate	2016
url	http://ndltd.ncl.edu.tw/handle/10340938691976874559
work_keys_str_mv	AT yuanshenghsiao deeplearningtechniquesforrgbdvisualrecognitionsystem AT xiāoyuánshēng deeplearningtechniquesforrgbdvisualrecognitionsystem AT yuanshenghsiao jīyúshēndùxuéxízhīrgbdshìjuébiànshíxìtǒng AT xiāoyuánshēng jīyúshēndùxuéxízhīrgbdshìjuébiànshíxìtǒng
_version_	1718540244976926720

Deep Learning Techniques for RGB-D Visual Recognition System

Similar Items