Lip-reading Recognition Using Motion Vectors

碩士 === 大同大學 === 資訊工程研究所 === 91 === Because the inputs of the speech recognition are voice signal, when the signal is noisy, the speech recognition is influenced by noise. However, the input images of the lip-reading recognition are not influenced by this noise. In addition, when someone...

Full description

Bibliographic Details
Main Authors: Sheng-Hsiang Hsu, 許聖祥
Other Authors: Tsang-Long Pao
Format: Others
Language:en_US
Published: 2003
Online Access:http://ndltd.ncl.edu.tw/handle/07341077997140588555
Description
Summary:碩士 === 大同大學 === 資訊工程研究所 === 91 === Because the inputs of the speech recognition are voice signal, when the signal is noisy, the speech recognition is influenced by noise. However, the input images of the lip-reading recognition are not influenced by this noise. In addition, when someone pronounces the nasal sounds, such as /m/、/n/、/ng/, the voice information is difficult to distinguish. But when we pronounce these voices, the changes of the mouth shapes are different. In this thesis, we proposed a lip-reading recognition system using motion vectors. Our approach uses the motion of lips as the feature. We extract predetermined points of lips, and uses block matching approach to find motion vectors. According to the motion variations of those selected points, we can perform the lip-reading recognition. Experimental results show that the propose method using motion vectors is useful and the lip-reading recognition is efficient.