Source Localization, Speech Enhancement and Recognition of a Singing Robot

碩士 === 國立交通大學 === 機械工程學系 === 98 === Nowadays, there are a variety of functional robots included security robot, military robot, household robot and recreational robot, etc. With social progress and the attention of quality life, entertainment undertakings play an important role recently. In this the...

Full description

Bibliographic Details
Main Authors: Kuei, Chen-Yi, 桂振益
Other Authors: Bai, Ming-Sian
Format: Others
Language:en_US
Published: 2010
Online Access:http://ndltd.ncl.edu.tw/handle/89646408082160026854
id ndltd-TW-098NCTU5489126
record_format oai_dc
spelling ndltd-TW-098NCTU54891262015-10-13T18:58:40Z http://ndltd.ncl.edu.tw/handle/89646408082160026854 Source Localization, Speech Enhancement and Recognition of a Singing Robot 具聽音辨位和語音增強及辨識的唱歌機器人 Kuei, Chen-Yi 桂振益 碩士 國立交通大學 機械工程學系 98 Nowadays, there are a variety of functional robots included security robot, military robot, household robot and recreational robot, etc. With social progress and the attention of quality life, entertainment undertakings play an important role recently. In this thesis, we present a nickelodeon robot with simultaneous human-tracking. Therefore, the robot contains source localization and speech recognition techniques. The methods of source localization include object-related transfer function (ORTF) based, cross correlation (CC) and generalized cross correlation (CC) method. We recognize words by employing dynamic time warping (DTW) to do dynamic matching after feature extraction. For the purpose of increasing the purity of the command voice, we adopt speech enhancement which contains array signal processing and phase difference (PD) method. All algorithms we take the best one of each purpose to implement on the LEGO NXT robot controlled by Windows-based NI DAQ system. Bai, Ming-Sian 白明憲 2010 學位論文 ; thesis 59 en_US
collection NDLTD
language en_US
format Others
sources NDLTD
description 碩士 === 國立交通大學 === 機械工程學系 === 98 === Nowadays, there are a variety of functional robots included security robot, military robot, household robot and recreational robot, etc. With social progress and the attention of quality life, entertainment undertakings play an important role recently. In this thesis, we present a nickelodeon robot with simultaneous human-tracking. Therefore, the robot contains source localization and speech recognition techniques. The methods of source localization include object-related transfer function (ORTF) based, cross correlation (CC) and generalized cross correlation (CC) method. We recognize words by employing dynamic time warping (DTW) to do dynamic matching after feature extraction. For the purpose of increasing the purity of the command voice, we adopt speech enhancement which contains array signal processing and phase difference (PD) method. All algorithms we take the best one of each purpose to implement on the LEGO NXT robot controlled by Windows-based NI DAQ system.
author2 Bai, Ming-Sian
author_facet Bai, Ming-Sian
Kuei, Chen-Yi
桂振益
author Kuei, Chen-Yi
桂振益
spellingShingle Kuei, Chen-Yi
桂振益
Source Localization, Speech Enhancement and Recognition of a Singing Robot
author_sort Kuei, Chen-Yi
title Source Localization, Speech Enhancement and Recognition of a Singing Robot
title_short Source Localization, Speech Enhancement and Recognition of a Singing Robot
title_full Source Localization, Speech Enhancement and Recognition of a Singing Robot
title_fullStr Source Localization, Speech Enhancement and Recognition of a Singing Robot
title_full_unstemmed Source Localization, Speech Enhancement and Recognition of a Singing Robot
title_sort source localization, speech enhancement and recognition of a singing robot
publishDate 2010
url http://ndltd.ncl.edu.tw/handle/89646408082160026854
work_keys_str_mv AT kueichenyi sourcelocalizationspeechenhancementandrecognitionofasingingrobot
AT guìzhènyì sourcelocalizationspeechenhancementandrecognitionofasingingrobot
AT kueichenyi jùtīngyīnbiànwèihéyǔyīnzēngqiángjíbiànshídechànggējīqìrén
AT guìzhènyì jùtīngyīnbiànwèihéyǔyīnzēngqiángjíbiànshídechànggējīqìrén
_version_ 1718038105606324224