A Python Implementation of Automatic Speech-text Synchronization Using Speech Recognition and Text-to-Speech Technology

碩士 === 長庚大學 === 資訊工程學系 === 103 === With the advent of the global village, "language learning" has become an important issue. Now, the variety of language ability is an indicator of competitiveness. Especially the listening and speaking ability are considered more important. In this study,...

Full description

Bibliographic Details
Main Authors: Chun Han Lai, 賴俊翰
Other Authors: R. Y. Lyu
Format: Others
Published: 2015
Online Access:http://ndltd.ncl.edu.tw/handle/53806441331969263004
Description
Summary:碩士 === 長庚大學 === 資訊工程學系 === 103 === With the advent of the global village, "language learning" has become an important issue. Now, the variety of language ability is an indicator of competitiveness. Especially the listening and speaking ability are considered more important. In this study, we establish a method to create speech and text synchronized audiobooks with “speech recognition” and “cloud text-to-speech” technology. The user can prepare his own arbitrary articles to create the learning materials for "Shadowing technique" with this method. Besides, the materials are made by "word-level" speech and text synchronized audiobooks. These audiobooks are created by "timed-text" files, and the files are produced from the user's articles and corresponding speech files. By synchronization for speech and text technology, named "CGUAlign", user can easily make the "Timed-text" files. CGUAlign, uses Python to wrap the well-known speech recognition technology─HTK(Hidden Markov Model Toolkit). Just providing text file and the corresponding speech file, obtained from cloud text-to-speech technology, CGUAlign can create the timed-text file to achieve the synchronization of speech and text. Subsequently, we also build a simple website created with JavaScript. This website can use the timed-text file as CALL(Computer-assisted Language Learning) purposes. Using the website, user can browse the synchronized audiobooks to easily do Shadowing technique. Finally this website also provides dictionary function to achieve the goal of CALL.