Query-Driven Strategy for On-the-Fly Term Spotting in Spontaneous Speech
Spoken utterance retrieval was largely studied in the last decades, with the purpose of indexing large audio databases or of detecting keywords in continuous speech streams. While the indexing of closed corpora can be performed via a batch process, on-line spotting systems have to synchronously dete...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
SpringerOpen
2010-01-01
|
Series: | EURASIP Journal on Audio, Speech, and Music Processing |
Online Access: | http://dx.doi.org/10.1155/2010/326578 |
id |
doaj-a8f21253182c4ac88471e7199e30dcf7 |
---|---|
record_format |
Article |
spelling |
doaj-a8f21253182c4ac88471e7199e30dcf72020-11-25T01:40:11ZengSpringerOpenEURASIP Journal on Audio, Speech, and Music Processing1687-47141687-47222010-01-01201010.1155/2010/326578Query-Driven Strategy for On-the-Fly Term Spotting in Spontaneous SpeechMickael RouvierGeorges LinarèsBenjamin LecouteuxSpoken utterance retrieval was largely studied in the last decades, with the purpose of indexing large audio databases or of detecting keywords in continuous speech streams. While the indexing of closed corpora can be performed via a batch process, on-line spotting systems have to synchronously detect the targeted spoken utterances. We propose a two-level architecture for on-the-fly term spotting. The first level performs a fast detection of the speech segments that probably contain the targeted utterance. The second level refines the detection on the selected segments, by using a speech recognizer based on a query-driven decoding algorithm. Experiments are conducted on both broadcast and spontaneous speech corpora. We investigate the impact of the spontaneity level on system performance. Results show that our method remains effective even if the recognition rates are significantly degraded by disfluencies. http://dx.doi.org/10.1155/2010/326578 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Mickael Rouvier Georges Linarès Benjamin Lecouteux |
spellingShingle |
Mickael Rouvier Georges Linarès Benjamin Lecouteux Query-Driven Strategy for On-the-Fly Term Spotting in Spontaneous Speech EURASIP Journal on Audio, Speech, and Music Processing |
author_facet |
Mickael Rouvier Georges Linarès Benjamin Lecouteux |
author_sort |
Mickael Rouvier |
title |
Query-Driven Strategy for On-the-Fly Term Spotting in Spontaneous Speech |
title_short |
Query-Driven Strategy for On-the-Fly Term Spotting in Spontaneous Speech |
title_full |
Query-Driven Strategy for On-the-Fly Term Spotting in Spontaneous Speech |
title_fullStr |
Query-Driven Strategy for On-the-Fly Term Spotting in Spontaneous Speech |
title_full_unstemmed |
Query-Driven Strategy for On-the-Fly Term Spotting in Spontaneous Speech |
title_sort |
query-driven strategy for on-the-fly term spotting in spontaneous speech |
publisher |
SpringerOpen |
series |
EURASIP Journal on Audio, Speech, and Music Processing |
issn |
1687-4714 1687-4722 |
publishDate |
2010-01-01 |
description |
Spoken utterance retrieval was largely studied in the last decades, with the purpose of indexing large audio databases or of detecting keywords in continuous speech streams. While the indexing of closed corpora can be performed via a batch process, on-line spotting systems have to synchronously detect the targeted spoken utterances. We propose a two-level architecture for on-the-fly term spotting. The first level performs a fast detection of the speech segments that probably contain the targeted utterance. The second level refines the detection on the selected segments, by using a speech recognizer based on a query-driven decoding algorithm. Experiments are conducted on both broadcast and spontaneous speech corpora. We investigate the impact of the spontaneity level on system performance. Results show that our method remains effective even if the recognition rates are significantly degraded by disfluencies. |
url |
http://dx.doi.org/10.1155/2010/326578 |
work_keys_str_mv |
AT mickaelrouvier querydrivenstrategyforontheflytermspottinginspontaneousspeech AT georgeslinaramp232s querydrivenstrategyforontheflytermspottinginspontaneousspeech AT benjaminlecouteux querydrivenstrategyforontheflytermspottinginspontaneousspeech |
_version_ |
1725046625734230016 |