A Speech recognition-based telephone auto-attendant
This dissertation details the implementation of a real-time, speaker-independent telephone auto attendant from first principles on limited quality speech data. An auto attendant is a computerized agent that answers the phone and switches the caller through to the desired person's extension afte...
Main Author: | |
---|---|
Other Authors: | |
Published: |
2013
|
Subjects: | |
Online Access: | http://hdl.handle.net/2263/29510 Van Leeuwen, GFvB 2001, A speech recognition-based telephone auto-attendant, MEng dissertation, University of Pretoria, Pretoria, viewed yymmdd < http://hdl.handle.net/2263/29510 > http://upetd.up.ac.za/thesis/available/etd-11172005-090624/ |
id |
ndltd-netd.ac.za-oai-union.ndltd.org-up-oai-repository.up.ac.za-2263-29510 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-netd.ac.za-oai-union.ndltd.org-up-oai-repository.up.ac.za-2263-295102017-07-20T04:11:43Z A Speech recognition-based telephone auto-attendant Van Leeuwen, Gysbert Floris Van Beek Prof E C Botha upetd@up.ac.za Telephone answering services automation Automatic speech recognition UCTD This dissertation details the implementation of a real-time, speaker-independent telephone auto attendant from first principles on limited quality speech data. An auto attendant is a computerized agent that answers the phone and switches the caller through to the desired person's extension after conducting a limited dialogue to determine the wishes of the caller, through the use of speech recognition technology. The platform is a computer with a telephone interface card. The speech recognition engine uses whole word hidden Markov modelling, with limited vocabulary and constrained (finite state) grammar. The feature set used is based on Mel frequency spaced cepstral coefficients. The Viterbi search is used together with the level building algorithm to recognise speech within the utterances. Word-spotting techniques including a "garbage" model, are used. Various techniques compensating for noise and a varying channel transfer function are employed to improve the recognition rate. An Afrikaans conversational interface prompts the caller for information. Detailed experiments illustrate the dependence and sensitivity of the system on its parameters, and show the influence of several techniques aimed at improving the recognition rate. Dissertation (MEng (Computer Engineering))--University of Pretoria, 2006. Electrical, Electronic and Computer Engineering unrestricted 2013-09-07T15:48:53Z 2005-11-21 2013-09-07T15:48:53Z 2002-04-01 2006-11-21 2005-11-17 Dissertation http://hdl.handle.net/2263/29510 Van Leeuwen, GFvB 2001, A speech recognition-based telephone auto-attendant, MEng dissertation, University of Pretoria, Pretoria, viewed yymmdd < http://hdl.handle.net/2263/29510 > H549/ag http://upetd.up.ac.za/thesis/available/etd-11172005-090624/ © 2001, University of Pretoria. All rights reserved. The copyright in this work vests in the University of Pretoria. No part of this work may be reproduced or transmitted in any form or by any means, without the prior written permission of the University of Pretoria. |
collection |
NDLTD |
sources |
NDLTD |
topic |
Telephone answering services automation Automatic speech recognition UCTD |
spellingShingle |
Telephone answering services automation Automatic speech recognition UCTD Van Leeuwen, Gysbert Floris Van Beek A Speech recognition-based telephone auto-attendant |
description |
This dissertation details the implementation of a real-time, speaker-independent telephone auto attendant from first principles on limited quality speech data. An auto attendant is a computerized agent that answers the phone and switches the caller through to the desired person's extension after conducting a limited dialogue to determine the wishes of the caller, through the use of speech recognition technology. The platform is a computer with a telephone interface card. The speech recognition engine uses whole word hidden Markov modelling, with limited vocabulary and constrained (finite state) grammar. The feature set used is based on Mel frequency spaced cepstral coefficients. The Viterbi search is used together with the level building algorithm to recognise speech within the utterances. Word-spotting techniques including a "garbage" model, are used. Various techniques compensating for noise and a varying channel transfer function are employed to improve the recognition rate. An Afrikaans conversational interface prompts the caller for information. Detailed experiments illustrate the dependence and sensitivity of the system on its parameters, and show the influence of several techniques aimed at improving the recognition rate. === Dissertation (MEng (Computer Engineering))--University of Pretoria, 2006. === Electrical, Electronic and Computer Engineering === unrestricted |
author2 |
Prof E C Botha |
author_facet |
Prof E C Botha Van Leeuwen, Gysbert Floris Van Beek |
author |
Van Leeuwen, Gysbert Floris Van Beek |
author_sort |
Van Leeuwen, Gysbert Floris Van Beek |
title |
A Speech recognition-based telephone auto-attendant |
title_short |
A Speech recognition-based telephone auto-attendant |
title_full |
A Speech recognition-based telephone auto-attendant |
title_fullStr |
A Speech recognition-based telephone auto-attendant |
title_full_unstemmed |
A Speech recognition-based telephone auto-attendant |
title_sort |
speech recognition-based telephone auto-attendant |
publishDate |
2013 |
url |
http://hdl.handle.net/2263/29510 Van Leeuwen, GFvB 2001, A speech recognition-based telephone auto-attendant, MEng dissertation, University of Pretoria, Pretoria, viewed yymmdd < http://hdl.handle.net/2263/29510 > http://upetd.up.ac.za/thesis/available/etd-11172005-090624/ |
work_keys_str_mv |
AT vanleeuwengysbertflorisvanbeek aspeechrecognitionbasedtelephoneautoattendant AT vanleeuwengysbertflorisvanbeek speechrecognitionbasedtelephoneautoattendant |
_version_ |
1718499288270503936 |