Segmentation and labelling of speech
During the last decades, significant research efforts have been aimed at devoloping speech technology products such as speech input and output systems. In order to train and evaluate these systems huge speech databases have been compiled in laboratories all over the world. However, neither the recor...
Main Author: | |
---|---|
Format: | Doctoral Thesis |
Language: | English |
Published: |
Norges teknisk-naturvitenskapelige universitet, Institutt for elektronikk og telekommunikasjon
1993
|
Subjects: | |
Online Access: | http://urn.kb.se/resolve?urn=urn:nbn:no:ntnu:diva-977 http://nbn-resolving.de/urn:isbn:82-7119-592-1 |
id |
ndltd-UPSALLA1-oai-DiVA.org-ntnu-977 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-UPSALLA1-oai-DiVA.org-ntnu-9772013-04-19T20:50:05ZSegmentation and labelling of speechengKvale, KnutNorges teknisk-naturvitenskapelige universitet, Institutt for elektronikk og telekommunikasjonFakultet for informasjonsteknologi, matematikk og elektroteknikk1993TalekodingDatamaskinerTeleteknikkDuring the last decades, significant research efforts have been aimed at devoloping speech technology products such as speech input and output systems. In order to train and evaluate these systems huge speech databases have been compiled in laboratories all over the world. However, neither the recording protocols nor the annotation conventions used have been standardised, making assessments of speech technology products across laboratories and languages difficult. The aim of this thesis work is to contribute towards a standardisation of segmentation and labelling of multi-lingual speech corpora. Segmentation is here defined as the process of dividing the speech pressure waveform into directly succeeding discrete parts. These segments are labelled with phoneme symbols. Continuous speech from five different languages; English, Danish, Swedish, Italien, and Norwegian, have been studied with respect to segmentation and labelling. Due to coarticulation effects, exact segmentation of speech as defined above is theoretically impossible, but the segmentation and labelling provides a link between the speech waveform and the phonological labels which is nevertheless essential for both speech research and for the development of speech technology. Thus, this thesis takes a pragmatic approach to the segmentation and labelling of speech and suggests methods to make the annotation process accurate and reliable enough for practical use. Doctoral thesis, monographinfo:eu-repo/semantics/doctoralThesistexthttp://urn.kb.se/resolve?urn=urn:nbn:no:ntnu:diva-977urn:isbn:82-7119-592-1Dr. ingeniøravhandling, 0809-103X ; 1993:126application/pdfinfo:eu-repo/semantics/openAccess |
collection |
NDLTD |
language |
English |
format |
Doctoral Thesis |
sources |
NDLTD |
topic |
Talekoding Datamaskiner Teleteknikk |
spellingShingle |
Talekoding Datamaskiner Teleteknikk Kvale, Knut Segmentation and labelling of speech |
description |
During the last decades, significant research efforts have been aimed at devoloping speech technology products such as speech input and output systems. In order to train and evaluate these systems huge speech databases have been compiled in laboratories all over the world. However, neither the recording protocols nor the annotation conventions used have been standardised, making assessments of speech technology products across laboratories and languages difficult. The aim of this thesis work is to contribute towards a standardisation of segmentation and labelling of multi-lingual speech corpora. Segmentation is here defined as the process of dividing the speech pressure waveform into directly succeeding discrete parts. These segments are labelled with phoneme symbols. Continuous speech from five different languages; English, Danish, Swedish, Italien, and Norwegian, have been studied with respect to segmentation and labelling. Due to coarticulation effects, exact segmentation of speech as defined above is theoretically impossible, but the segmentation and labelling provides a link between the speech waveform and the phonological labels which is nevertheless essential for both speech research and for the development of speech technology. Thus, this thesis takes a pragmatic approach to the segmentation and labelling of speech and suggests methods to make the annotation process accurate and reliable enough for practical use. |
author |
Kvale, Knut |
author_facet |
Kvale, Knut |
author_sort |
Kvale, Knut |
title |
Segmentation and labelling of speech |
title_short |
Segmentation and labelling of speech |
title_full |
Segmentation and labelling of speech |
title_fullStr |
Segmentation and labelling of speech |
title_full_unstemmed |
Segmentation and labelling of speech |
title_sort |
segmentation and labelling of speech |
publisher |
Norges teknisk-naturvitenskapelige universitet, Institutt for elektronikk og telekommunikasjon |
publishDate |
1993 |
url |
http://urn.kb.se/resolve?urn=urn:nbn:no:ntnu:diva-977 http://nbn-resolving.de/urn:isbn:82-7119-592-1 |
work_keys_str_mv |
AT kvaleknut segmentationandlabellingofspeech |
_version_ |
1716582964940242944 |