Research on the Difference between Lyric Recognition and Speech Recognition in Mandarin Songs

碩士 === 國立臺灣大學 === 工程科學及海洋工程學研究所 === 107 === Song retrieval is an indispensable part in modern life. One expects to find the song simply by singing few words or humming a period of the it. Most websites and mobile apps use features of melody in song retrieval tasks nowadays. However, the search metho...

Full description

Bibliographic Details
Main Authors: Chu-An Yu, 游筑安
Other Authors: Chien-Kang Huang
Format: Others
Language:zh-TW
Published: 2019
Online Access:http://ndltd.ncl.edu.tw/handle/w282q5
Description
Summary:碩士 === 國立臺灣大學 === 工程科學及海洋工程學研究所 === 107 === Song retrieval is an indispensable part in modern life. One expects to find the song simply by singing few words or humming a period of the it. Most websites and mobile apps use features of melody in song retrieval tasks nowadays. However, the search method is inconvenient for users who cannot sing in accurate tones. It will not get the correct result for the inaccurate melody. Therefore, if the words in the song can be recognized, it will greatly improve the accuracy of the song search. This research is divided into four parts. The first part compare singing audio and reading audio. The second part is preprocessing the song. It removes the background noise and forces vocal. The third part is extracting features. It converts the audio into a spectrogram as an input image for the training model. The fourth part uses convolutional neural network (CNN) model and connectionist temporal classification (CTC) model to train acoustic model.