Segmentation and labelling of speech

During the last decades, significant research efforts have been aimed at devoloping speech technology products such as speech input and output systems. In order to train and evaluate these systems huge speech databases have been compiled in laboratories all over the world. However, neither the recor...

Full description

Bibliographic Details
Main Author: Kvale, Knut
Format: Doctoral Thesis
Language:English
Published: Norges teknisk-naturvitenskapelige universitet, Institutt for elektronikk og telekommunikasjon 1993
Subjects:
Online Access:http://urn.kb.se/resolve?urn=urn:nbn:no:ntnu:diva-977
http://nbn-resolving.de/urn:isbn:82-7119-592-1
id ndltd-UPSALLA1-oai-DiVA.org-ntnu-977
record_format oai_dc
spelling ndltd-UPSALLA1-oai-DiVA.org-ntnu-9772013-04-19T20:50:05ZSegmentation and labelling of speechengKvale, KnutNorges teknisk-naturvitenskapelige universitet, Institutt for elektronikk og telekommunikasjonFakultet for informasjonsteknologi, matematikk og elektroteknikk1993TalekodingDatamaskinerTeleteknikkDuring the last decades, significant research efforts have been aimed at devoloping speech technology products such as speech input and output systems. In order to train and evaluate these systems huge speech databases have been compiled in laboratories all over the world. However, neither the recording protocols nor the annotation conventions used have been standardised, making assessments of speech technology products across laboratories and languages difficult. The aim of this thesis work is to contribute towards a standardisation of segmentation and labelling of multi-lingual speech corpora. Segmentation is here defined as the process of dividing the speech pressure waveform into directly succeeding discrete parts. These segments are labelled with phoneme symbols. Continuous speech from five different languages; English, Danish, Swedish, Italien, and Norwegian, have been studied with respect to segmentation and labelling. Due to coarticulation effects, exact segmentation of speech as defined above is theoretically impossible, but the segmentation and labelling provides a link between the speech waveform and the phonological labels which is nevertheless essential for both speech research and for the development of speech technology. Thus, this thesis takes a pragmatic approach to the segmentation and labelling of speech and suggests methods to make the annotation process accurate and reliable enough for practical use. Doctoral thesis, monographinfo:eu-repo/semantics/doctoralThesistexthttp://urn.kb.se/resolve?urn=urn:nbn:no:ntnu:diva-977urn:isbn:82-7119-592-1Dr. ingeniøravhandling, 0809-103X ; 1993:126application/pdfinfo:eu-repo/semantics/openAccess
collection NDLTD
language English
format Doctoral Thesis
sources NDLTD
topic Talekoding
Datamaskiner
Teleteknikk
spellingShingle Talekoding
Datamaskiner
Teleteknikk
Kvale, Knut
Segmentation and labelling of speech
description During the last decades, significant research efforts have been aimed at devoloping speech technology products such as speech input and output systems. In order to train and evaluate these systems huge speech databases have been compiled in laboratories all over the world. However, neither the recording protocols nor the annotation conventions used have been standardised, making assessments of speech technology products across laboratories and languages difficult. The aim of this thesis work is to contribute towards a standardisation of segmentation and labelling of multi-lingual speech corpora. Segmentation is here defined as the process of dividing the speech pressure waveform into directly succeeding discrete parts. These segments are labelled with phoneme symbols. Continuous speech from five different languages; English, Danish, Swedish, Italien, and Norwegian, have been studied with respect to segmentation and labelling. Due to coarticulation effects, exact segmentation of speech as defined above is theoretically impossible, but the segmentation and labelling provides a link between the speech waveform and the phonological labels which is nevertheless essential for both speech research and for the development of speech technology. Thus, this thesis takes a pragmatic approach to the segmentation and labelling of speech and suggests methods to make the annotation process accurate and reliable enough for practical use.
author Kvale, Knut
author_facet Kvale, Knut
author_sort Kvale, Knut
title Segmentation and labelling of speech
title_short Segmentation and labelling of speech
title_full Segmentation and labelling of speech
title_fullStr Segmentation and labelling of speech
title_full_unstemmed Segmentation and labelling of speech
title_sort segmentation and labelling of speech
publisher Norges teknisk-naturvitenskapelige universitet, Institutt for elektronikk og telekommunikasjon
publishDate 1993
url http://urn.kb.se/resolve?urn=urn:nbn:no:ntnu:diva-977
http://nbn-resolving.de/urn:isbn:82-7119-592-1
work_keys_str_mv AT kvaleknut segmentationandlabellingofspeech
_version_ 1716582964940242944