TEXT-TO-SPEECH SYNTHESIS FOR BRAZILIAN PORTUGUESE

Este trabalho apresenta um sistema de síntese de voz a partir de texto irrestrito para a língua portuguesa falada no Brasil. O sistema é baseado na técnica de concatenação, por regras, de unidades de voz previamente codificadas. Propõe-se um inventário de unidades de síntese extremamente reduzi...

Full description

Bibliographic Details
Main Author:	JOSE ALBERTO SOLEWICZ
Other Authors:	ABRAHAM ALCAIM
Language:	Portuguese
Published:	PONTIFÍCIA UNIVERSIDADE CATÓLICA DO RIO DE JANEIRO 1993
Online Access:	http://www.maxwell.vrac.puc-rio.br/Busca_etds.php?strSecao=resultado&nrSeq=8690@1 http://www.maxwell.vrac.puc-rio.br/Busca_etds.php?strSecao=resultado&nrSeq=8690@2

id	ndltd-IBICT-oai-MAXWELL.puc-rio.br-8690
record_format	oai_dc
spelling	ndltd-IBICT-oai-MAXWELL.puc-rio.br-86902019-03-01T15:36:03Z TEXT-TO-SPEECH SYNTHESIS FOR BRAZILIAN PORTUGUESE SÍNTESE DE VOZ A PARTIR DE TEXTO PARA O PORTUGUÊS DO BRASIL JOSE ALBERTO SOLEWICZ ABRAHAM ALCAIM ABRAHAM ALCAIM FABIO VIOLARO JOAO ANTONIO DE MORAES Este trabalho apresenta um sistema de síntese de voz a partir de texto irrestrito para a língua portuguesa falada no Brasil. O sistema é baseado na técnica de concatenação, por regras, de unidades de voz previamente codificadas. Propõe-se um inventário de unidades de síntese extremamente reduzido (149 unidades) composto, basicamente, por transições consoante-vogal (CV), que representam segmentos acústicos cruciais no processo de produção da fala. Mostrou-se ser possível produzir voz altamente inteligível através da concatenação destas unidades. É proposto, também, o uso de um modelo CELP como estrutura de compressão e síntese do inventário de unidades, incluindo as adaptações necessárias para as alterações prosódicas do sinal no momento de sua codificação. Resultados de testes auditivos mostraram que a síntese através do modelo CELP proposto é superior àquela obtida através do Vocoder-LPC (excitação mono- pulso/ruído) usualmente empregado nos sistemas de síntese de voz a partir de texto. This work presents na unrestricted text-to-speech synthesis system for brazilian portuguese. The system is based on the concatenation by rules of previously coded speech units. An extremely reduced set of synthesis units (149) is proposed. This set is mostly comprised of consonant-vowel (CV) transitions, which represent crucial acoustic segments in the speech production process. Production of highly intelligible speech is show to be possible through concatenation of these units. A CELP model is also proposed as a compression and synthesis structure, which includes necessary adaptations in order to modify the speech prosody during its decoding phase. Subjective tests showed that speech synthesized through the proposed CELP model is judged superior to that obtained through an LPC Vocoder (mono-pulse/noise excited), which is traditionally used in text-to-speech synthesis systems. 1993-08-31 info:eu-repo/semantics/publishedVersion info:eu-repo/semantics/masterThesis http://www.maxwell.vrac.puc-rio.br/Busca_etds.php?strSecao=resultado&nrSeq=8690@1 http://www.maxwell.vrac.puc-rio.br/Busca_etds.php?strSecao=resultado&nrSeq=8690@2 por info:eu-repo/semantics/openAccess PONTIFÍCIA UNIVERSIDADE CATÓLICA DO RIO DE JANEIRO PPG EM ENGENHARIA ELÉTRICA PUC-Rio BR reponame:Repositório Institucional da PUC_RIO instname:Pontifícia Universidade Católica do Rio de Janeiro instacron:PUC_RIO
collection	NDLTD
language	Portuguese
sources	NDLTD
description	Este trabalho apresenta um sistema de síntese de voz a partir de texto irrestrito para a língua portuguesa falada no Brasil. O sistema é baseado na técnica de concatenação, por regras, de unidades de voz previamente codificadas. Propõe-se um inventário de unidades de síntese extremamente reduzido (149 unidades) composto, basicamente, por transições consoante-vogal (CV), que representam segmentos acústicos cruciais no processo de produção da fala. Mostrou-se ser possível produzir voz altamente inteligível através da concatenação destas unidades. É proposto, também, o uso de um modelo CELP como estrutura de compressão e síntese do inventário de unidades, incluindo as adaptações necessárias para as alterações prosódicas do sinal no momento de sua codificação. Resultados de testes auditivos mostraram que a síntese através do modelo CELP proposto é superior àquela obtida através do Vocoder-LPC (excitação mono- pulso/ruído) usualmente empregado nos sistemas de síntese de voz a partir de texto. === This work presents na unrestricted text-to-speech synthesis system for brazilian portuguese. The system is based on the concatenation by rules of previously coded speech units. An extremely reduced set of synthesis units (149) is proposed. This set is mostly comprised of consonant-vowel (CV) transitions, which represent crucial acoustic segments in the speech production process. Production of highly intelligible speech is show to be possible through concatenation of these units. A CELP model is also proposed as a compression and synthesis structure, which includes necessary adaptations in order to modify the speech prosody during its decoding phase. Subjective tests showed that speech synthesized through the proposed CELP model is judged superior to that obtained through an LPC Vocoder (mono-pulse/noise excited), which is traditionally used in text-to-speech synthesis systems.
author2	ABRAHAM ALCAIM
author_facet	ABRAHAM ALCAIM JOSE ALBERTO SOLEWICZ
author	JOSE ALBERTO SOLEWICZ
spellingShingle	JOSE ALBERTO SOLEWICZ TEXT-TO-SPEECH SYNTHESIS FOR BRAZILIAN PORTUGUESE
author_sort	JOSE ALBERTO SOLEWICZ
title	TEXT-TO-SPEECH SYNTHESIS FOR BRAZILIAN PORTUGUESE
title_short	TEXT-TO-SPEECH SYNTHESIS FOR BRAZILIAN PORTUGUESE
title_full	TEXT-TO-SPEECH SYNTHESIS FOR BRAZILIAN PORTUGUESE
title_fullStr	TEXT-TO-SPEECH SYNTHESIS FOR BRAZILIAN PORTUGUESE
title_full_unstemmed	TEXT-TO-SPEECH SYNTHESIS FOR BRAZILIAN PORTUGUESE
title_sort	text-to-speech synthesis for brazilian portuguese
publisher	PONTIFÍCIA UNIVERSIDADE CATÓLICA DO RIO DE JANEIRO
publishDate	1993
url	http://www.maxwell.vrac.puc-rio.br/Busca_etds.php?strSecao=resultado&nrSeq=8690@1 http://www.maxwell.vrac.puc-rio.br/Busca_etds.php?strSecao=resultado&nrSeq=8690@2
work_keys_str_mv	AT josealbertosolewicz texttospeechsynthesisforbrazilianportuguese AT josealbertosolewicz sintesedevozapartirdetextoparaoportuguesdobrasil
_version_	1718986761026142208

TEXT-TO-SPEECH SYNTHESIS FOR BRAZILIAN PORTUGUESE

Similar Items