Exploring Effective Pseudo-Relevance Feedback and Proximity Information for Speech Retrieval and Transcription

碩士 === 國立臺灣師範大學 === 資訊工程學系 === 101 === Pseudo-relevance feedback is by far the most commonly-used paradigm for query reformulation in spoken document retrieval, which assumes that a small amount of top-ranked feedback documents obtained from the initial retrieval are relevant and can be utilized for...

Full description

Bibliographic Details
Main Author:	陳憶文
Other Authors:	Berlin Chen
Format:	Others
Language:	en_US
Published:	2013
Online Access:	http://ndltd.ncl.edu.tw/handle/24695216658836083699

id	ndltd-TW-101NTNU5392040
record_format	oai_dc
spelling	ndltd-TW-101NTNU53920402016-03-18T04:42:07Z http://ndltd.ncl.edu.tw/handle/24695216658836083699 Exploring Effective Pseudo-Relevance Feedback and Proximity Information for Speech Retrieval and Transcription 探索虛擬關聯回饋技術和鄰近資訊於語音文件檢索與辨識之改進陳憶文碩士國立臺灣師範大學資訊工程學系 101 Pseudo-relevance feedback is by far the most commonly-used paradigm for query reformulation in spoken document retrieval, which assumes that a small amount of top-ranked feedback documents obtained from the initial retrieval are relevant and can be utilized for query expansion. Nevertheless, simply taking all of the top-ranked feedback documents acquired from the initial retrieval for query modeling does not necessary work well, especially when the top-ranked documents contain much redundant or non-relevant cues. In view of this, we explore different kinds of information cues for selecting helpful feedback documents to further improve information retrieval. On the other hand, relevance model (RM) based on “bag-of-words” assumption, which can facilitate the derivation and estimation, may be oversimplified for the task of language modeling in speech recognition. Hence, we also enhance RM in two significant aspects. First, “bag-of-words” assumption of RM is relaxed by incorporating word proximity information into RM formulation. Second, topic-based proximity information is additionally explored to further enhance the proximity-based RM framework. Experiments conducted on not only a spoken document retrieval task but also a speech recognition task indicates that our approaches can bring competitive utilities to existing ones. Berlin Chen 陳柏琳 2013 學位論文 ; thesis 71 en_US
collection	NDLTD
language	en_US
format	Others
sources	NDLTD
description	碩士 === 國立臺灣師範大學 === 資訊工程學系 === 101 === Pseudo-relevance feedback is by far the most commonly-used paradigm for query reformulation in spoken document retrieval, which assumes that a small amount of top-ranked feedback documents obtained from the initial retrieval are relevant and can be utilized for query expansion. Nevertheless, simply taking all of the top-ranked feedback documents acquired from the initial retrieval for query modeling does not necessary work well, especially when the top-ranked documents contain much redundant or non-relevant cues. In view of this, we explore different kinds of information cues for selecting helpful feedback documents to further improve information retrieval. On the other hand, relevance model (RM) based on “bag-of-words” assumption, which can facilitate the derivation and estimation, may be oversimplified for the task of language modeling in speech recognition. Hence, we also enhance RM in two significant aspects. First, “bag-of-words” assumption of RM is relaxed by incorporating word proximity information into RM formulation. Second, topic-based proximity information is additionally explored to further enhance the proximity-based RM framework. Experiments conducted on not only a spoken document retrieval task but also a speech recognition task indicates that our approaches can bring competitive utilities to existing ones.
author2	Berlin Chen
author_facet	Berlin Chen 陳憶文
author	陳憶文
spellingShingle	陳憶文 Exploring Effective Pseudo-Relevance Feedback and Proximity Information for Speech Retrieval and Transcription
author_sort	陳憶文
title	Exploring Effective Pseudo-Relevance Feedback and Proximity Information for Speech Retrieval and Transcription
title_short	Exploring Effective Pseudo-Relevance Feedback and Proximity Information for Speech Retrieval and Transcription
title_full	Exploring Effective Pseudo-Relevance Feedback and Proximity Information for Speech Retrieval and Transcription
title_fullStr	Exploring Effective Pseudo-Relevance Feedback and Proximity Information for Speech Retrieval and Transcription
title_full_unstemmed	Exploring Effective Pseudo-Relevance Feedback and Proximity Information for Speech Retrieval and Transcription
title_sort	exploring effective pseudo-relevance feedback and proximity information for speech retrieval and transcription
publishDate	2013
url	http://ndltd.ncl.edu.tw/handle/24695216658836083699
work_keys_str_mv	AT chényìwén exploringeffectivepseudorelevancefeedbackandproximityinformationforspeechretrievalandtranscription AT chényìwén tànsuǒxūnǐguānliánhuíkuìjìshùhélínjìnzīxùnyúyǔyīnwénjiànjiǎnsuǒyǔbiànshízhīgǎijìn
_version_	1718208304063184896

Exploring Effective Pseudo-Relevance Feedback and Proximity Information for Speech Retrieval and Transcription

Similar Items