Query-Driven Strategy for On-the-Fly Term Spotting in Spontaneous Speech

Spoken utterance retrieval was largely studied in the last decades, with the purpose of indexing large audio databases or of detecting keywords in continuous speech streams. While the indexing of closed corpora can be performed via a batch process, on-line spotting systems have to synchronously dete...

Full description

Bibliographic Details
Main Authors: Mickael Rouvier, Georges Linarès, Benjamin Lecouteux
Format: Article
Language:English
Published: SpringerOpen 2010-01-01
Series:EURASIP Journal on Audio, Speech, and Music Processing
Online Access:http://dx.doi.org/10.1155/2010/326578
id doaj-a8f21253182c4ac88471e7199e30dcf7
record_format Article
spelling doaj-a8f21253182c4ac88471e7199e30dcf72020-11-25T01:40:11ZengSpringerOpenEURASIP Journal on Audio, Speech, and Music Processing1687-47141687-47222010-01-01201010.1155/2010/326578Query-Driven Strategy for On-the-Fly Term Spotting in Spontaneous SpeechMickael RouvierGeorges LinarèsBenjamin LecouteuxSpoken utterance retrieval was largely studied in the last decades, with the purpose of indexing large audio databases or of detecting keywords in continuous speech streams. While the indexing of closed corpora can be performed via a batch process, on-line spotting systems have to synchronously detect the targeted spoken utterances. We propose a two-level architecture for on-the-fly term spotting. The first level performs a fast detection of the speech segments that probably contain the targeted utterance. The second level refines the detection on the selected segments, by using a speech recognizer based on a query-driven decoding algorithm. Experiments are conducted on both broadcast and spontaneous speech corpora. We investigate the impact of the spontaneity level on system performance. Results show that our method remains effective even if the recognition rates are significantly degraded by disfluencies. http://dx.doi.org/10.1155/2010/326578
collection DOAJ
language English
format Article
sources DOAJ
author Mickael Rouvier
Georges Linarès
Benjamin Lecouteux
spellingShingle Mickael Rouvier
Georges Linarès
Benjamin Lecouteux
Query-Driven Strategy for On-the-Fly Term Spotting in Spontaneous Speech
EURASIP Journal on Audio, Speech, and Music Processing
author_facet Mickael Rouvier
Georges Linarès
Benjamin Lecouteux
author_sort Mickael Rouvier
title Query-Driven Strategy for On-the-Fly Term Spotting in Spontaneous Speech
title_short Query-Driven Strategy for On-the-Fly Term Spotting in Spontaneous Speech
title_full Query-Driven Strategy for On-the-Fly Term Spotting in Spontaneous Speech
title_fullStr Query-Driven Strategy for On-the-Fly Term Spotting in Spontaneous Speech
title_full_unstemmed Query-Driven Strategy for On-the-Fly Term Spotting in Spontaneous Speech
title_sort query-driven strategy for on-the-fly term spotting in spontaneous speech
publisher SpringerOpen
series EURASIP Journal on Audio, Speech, and Music Processing
issn 1687-4714
1687-4722
publishDate 2010-01-01
description Spoken utterance retrieval was largely studied in the last decades, with the purpose of indexing large audio databases or of detecting keywords in continuous speech streams. While the indexing of closed corpora can be performed via a batch process, on-line spotting systems have to synchronously detect the targeted spoken utterances. We propose a two-level architecture for on-the-fly term spotting. The first level performs a fast detection of the speech segments that probably contain the targeted utterance. The second level refines the detection on the selected segments, by using a speech recognizer based on a query-driven decoding algorithm. Experiments are conducted on both broadcast and spontaneous speech corpora. We investigate the impact of the spontaneity level on system performance. Results show that our method remains effective even if the recognition rates are significantly degraded by disfluencies.
url http://dx.doi.org/10.1155/2010/326578
work_keys_str_mv AT mickaelrouvier querydrivenstrategyforontheflytermspottinginspontaneousspeech
AT georgeslinaramp232s querydrivenstrategyforontheflytermspottinginspontaneousspeech
AT benjaminlecouteux querydrivenstrategyforontheflytermspottinginspontaneousspeech
_version_ 1725046625734230016