Segmentation and labelling of speech

During the last decades, significant research efforts have been aimed at devoloping speech technology products such as speech input and output systems. In order to train and evaluate these systems huge speech databases have been compiled in laboratories all over the world. However, neither the recor...

Full description

Bibliographic Details
Main Author:	Kvale, Knut
Format:	Doctoral Thesis
Language:	English
Published:	Norges teknisk-naturvitenskapelige universitet, Institutt for elektronikk og telekommunikasjon 1993
Subjects:	Talekoding Datamaskiner Teleteknikk
Online Access:	http://urn.kb.se/resolve?urn=urn:nbn:no:ntnu:diva-977 http://nbn-resolving.de/urn:isbn:82-7119-592-1

id	ndltd-UPSALLA1-oai-DiVA.org-ntnu-977
record_format	oai_dc
spelling	ndltd-UPSALLA1-oai-DiVA.org-ntnu-9772013-04-19T20:50:05ZSegmentation and labelling of speechengKvale, KnutNorges teknisk-naturvitenskapelige universitet, Institutt for elektronikk og telekommunikasjonFakultet for informasjonsteknologi, matematikk og elektroteknikk1993TalekodingDatamaskinerTeleteknikkDuring the last decades, significant research efforts have been aimed at devoloping speech technology products such as speech input and output systems. In order to train and evaluate these systems huge speech databases have been compiled in laboratories all over the world. However, neither the recording protocols nor the annotation conventions used have been standardised, making assessments of speech technology products across laboratories and languages difficult. The aim of this thesis work is to contribute towards a standardisation of segmentation and labelling of multi-lingual speech corpora. Segmentation is here defined as the process of dividing the speech pressure waveform into directly succeeding discrete parts. These segments are labelled with phoneme symbols. Continuous speech from five different languages; English, Danish, Swedish, Italien, and Norwegian, have been studied with respect to segmentation and labelling. Due to coarticulation effects, exact segmentation of speech as defined above is theoretically impossible, but the segmentation and labelling provides a link between the speech waveform and the phonological labels which is nevertheless essential for both speech research and for the development of speech technology. Thus, this thesis takes a pragmatic approach to the segmentation and labelling of speech and suggests methods to make the annotation process accurate and reliable enough for practical use. Doctoral thesis, monographinfo:eu-repo/semantics/doctoralThesistexthttp://urn.kb.se/resolve?urn=urn:nbn:no:ntnu:diva-977urn:isbn:82-7119-592-1Dr. ingeniøravhandling, 0809-103X ; 1993:126application/pdfinfo:eu-repo/semantics/openAccess
collection	NDLTD
language	English
format	Doctoral Thesis
sources	NDLTD
topic	Talekoding Datamaskiner Teleteknikk
spellingShingle	Talekoding Datamaskiner Teleteknikk Kvale, Knut Segmentation and labelling of speech
description	During the last decades, significant research efforts have been aimed at devoloping speech technology products such as speech input and output systems. In order to train and evaluate these systems huge speech databases have been compiled in laboratories all over the world. However, neither the recording protocols nor the annotation conventions used have been standardised, making assessments of speech technology products across laboratories and languages difficult. The aim of this thesis work is to contribute towards a standardisation of segmentation and labelling of multi-lingual speech corpora. Segmentation is here defined as the process of dividing the speech pressure waveform into directly succeeding discrete parts. These segments are labelled with phoneme symbols. Continuous speech from five different languages; English, Danish, Swedish, Italien, and Norwegian, have been studied with respect to segmentation and labelling. Due to coarticulation effects, exact segmentation of speech as defined above is theoretically impossible, but the segmentation and labelling provides a link between the speech waveform and the phonological labels which is nevertheless essential for both speech research and for the development of speech technology. Thus, this thesis takes a pragmatic approach to the segmentation and labelling of speech and suggests methods to make the annotation process accurate and reliable enough for practical use.
author	Kvale, Knut
author_facet	Kvale, Knut
author_sort	Kvale, Knut
title	Segmentation and labelling of speech
title_short	Segmentation and labelling of speech
title_full	Segmentation and labelling of speech
title_fullStr	Segmentation and labelling of speech
title_full_unstemmed	Segmentation and labelling of speech
title_sort	segmentation and labelling of speech
publisher	Norges teknisk-naturvitenskapelige universitet, Institutt for elektronikk og telekommunikasjon
publishDate	1993
url	http://urn.kb.se/resolve?urn=urn:nbn:no:ntnu:diva-977 http://nbn-resolving.de/urn:isbn:82-7119-592-1
work_keys_str_mv	AT kvaleknut segmentationandlabellingofspeech
_version_	1716582964940242944

Segmentation and labelling of speech

Similar Items