Unit selection for Malay text-to-speech system using segmental context and simulated annealing

Unit selection method has become the main approach in speech synthesis. The increasing size of recorded speech has resulted in better synthesis speech quality but at the same time also resulted in more expensive computational effort. Therefore, this paper proposes a combination of segmental context...

Full description

Bibliographic Details
Main Authors: Tan, Tian Swee (Author), Shaikh Salleh, Sheikh Hussain (Author), Zainuddin, Zaitul Marlizawati (Author), Lim, Yee Chea (Author), Hum, Yan Chai (Author)
Format: Article
Language:English
Published: Academic Journals, 2011-09-19.
Subjects:
Online Access:Get fulltext
LEADER 01699 am a22001813u 4500
001 29907
042 |a dc 
100 1 0 |a Tan, Tian Swee  |e author 
700 1 0 |a Shaikh Salleh, Sheikh Hussain  |e author 
700 1 0 |a Zainuddin, Zaitul Marlizawati  |e author 
700 1 0 |a Lim, Yee Chea  |e author 
700 1 0 |a Hum, Yan Chai  |e author 
245 0 0 |a Unit selection for Malay text-to-speech system using segmental context and simulated annealing 
260 |b Academic Journals,   |c 2011-09-19. 
856 |z Get fulltext  |u http://eprints.utm.my/id/eprint/29907/1/TanTianSwee2011_UnitSelectionforMalayTexttospeechSystem.pdf 
520 |a Unit selection method has become the main approach in speech synthesis. The increasing size of recorded speech has resulted in better synthesis speech quality but at the same time also resulted in more expensive computational effort. Therefore, this paper proposes a combination of segmental context matching procedure and Simulated Annealing (SA) in unit selection to improve the quality of synthetic speech and reduce the computational time. The process of unit selection is based on minimization of two costs: target cost and join cost. The segmental context (target cost), the first stage of unit selection matching procedure used to narrow down the search space, followed by an optimization method which is SA to find the units sequence with minimum join cost. Result shows that the synthesis words produced by the proposed system are 15.48% better compared to previous version of corpus-based Malay Text-to-Speech system. Future works may focus on combining SA with other heuristic methods to further enhancing the performance of unit selection. 
546 |a en 
650 0 4 |a Q Science (General)