Synthesis of Speech Animation Based on Viseme Analysis

碩士 === 國立屏東教育大學 === 資訊科學系 === 98 === The research is expected to analyze the change of the lips by the process of the real person pronunciation,and arrange with the recognition of the speech to form the output the lips animation of real facial. First,we took the image of the real person pronunciatio...

Full description

Bibliographic Details
Main Authors: Kun-syong Jiang, 江坤雄
Other Authors: Yih-Kai Lin
Format: Others
Language:zh-TW
Published: 2010
Online Access:http://ndltd.ncl.edu.tw/handle/94938321067338443855
id ndltd-TW-098NPTT5394020
record_format oai_dc
spelling ndltd-TW-098NPTT53940202016-04-22T04:23:10Z http://ndltd.ncl.edu.tw/handle/94938321067338443855 Synthesis of Speech Animation Based on Viseme Analysis 基於視素分析之語音動畫合成 Kun-syong Jiang 江坤雄 碩士 國立屏東教育大學 資訊科學系 98 The research is expected to analyze the change of the lips by the process of the real person pronunciation,and arrange with the recognition of the speech to form the output the lips animation of real facial. First,we took the image of the real person pronunciation with the speech , and then separated the sound and the image . Through the training of Hidden Markov Model,the part of the sound would produce several models which would be recorded in order to synchronize . In the part of the image,we use the facial detected pattern to find the faces . Because we set up the shoot of the front face,we could acquire the position of the lips according to the proportion and find the red parts by the process of the image to acquire the characters of each pronounced lips , and combine the sound and the image to produce synchronization model . First,the users spoke the speech , and then we took the audio frequency to compare with the HMM models which had been established to acquire the model with the highest probability in Hidden Markov Model . The model would make the nearest audio frequency with the pronunciation information compare with and search the database of the lips which had been established to produce the output of the synchronization. Yih-Kai Lin 林義凱 2010/08/ 學位論文 ; thesis 36 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 國立屏東教育大學 === 資訊科學系 === 98 === The research is expected to analyze the change of the lips by the process of the real person pronunciation,and arrange with the recognition of the speech to form the output the lips animation of real facial. First,we took the image of the real person pronunciation with the speech , and then separated the sound and the image . Through the training of Hidden Markov Model,the part of the sound would produce several models which would be recorded in order to synchronize . In the part of the image,we use the facial detected pattern to find the faces . Because we set up the shoot of the front face,we could acquire the position of the lips according to the proportion and find the red parts by the process of the image to acquire the characters of each pronounced lips , and combine the sound and the image to produce synchronization model . First,the users spoke the speech , and then we took the audio frequency to compare with the HMM models which had been established to acquire the model with the highest probability in Hidden Markov Model . The model would make the nearest audio frequency with the pronunciation information compare with and search the database of the lips which had been established to produce the output of the synchronization.
author2 Yih-Kai Lin
author_facet Yih-Kai Lin
Kun-syong Jiang
江坤雄
author Kun-syong Jiang
江坤雄
spellingShingle Kun-syong Jiang
江坤雄
Synthesis of Speech Animation Based on Viseme Analysis
author_sort Kun-syong Jiang
title Synthesis of Speech Animation Based on Viseme Analysis
title_short Synthesis of Speech Animation Based on Viseme Analysis
title_full Synthesis of Speech Animation Based on Viseme Analysis
title_fullStr Synthesis of Speech Animation Based on Viseme Analysis
title_full_unstemmed Synthesis of Speech Animation Based on Viseme Analysis
title_sort synthesis of speech animation based on viseme analysis
publishDate 2010
url http://ndltd.ncl.edu.tw/handle/94938321067338443855
work_keys_str_mv AT kunsyongjiang synthesisofspeechanimationbasedonvisemeanalysis
AT jiāngkūnxióng synthesisofspeechanimationbasedonvisemeanalysis
AT kunsyongjiang jīyúshìsùfēnxīzhīyǔyīndònghuàhéchéng
AT jiāngkūnxióng jīyúshìsùfēnxīzhīyǔyīndònghuàhéchéng
_version_ 1718230444557729792