Exploring Effective Pseudo-Relevance Feedback and Proximity Information for Speech Retrieval and Transcription

碩士 === 國立臺灣師範大學 === 資訊工程學系 === 101 === Pseudo-relevance feedback is by far the most commonly-used paradigm for query reformulation in spoken document retrieval, which assumes that a small amount of top-ranked feedback documents obtained from the initial retrieval are relevant and can be utilized for...

Full description

Bibliographic Details
Main Author: 陳憶文
Other Authors: Berlin Chen
Format: Others
Language:en_US
Published: 2013
Online Access:http://ndltd.ncl.edu.tw/handle/24695216658836083699
id ndltd-TW-101NTNU5392040
record_format oai_dc
spelling ndltd-TW-101NTNU53920402016-03-18T04:42:07Z http://ndltd.ncl.edu.tw/handle/24695216658836083699 Exploring Effective Pseudo-Relevance Feedback and Proximity Information for Speech Retrieval and Transcription 探索虛擬關聯回饋技術和鄰近資訊於語音文件檢索與辨識之改進 陳憶文 碩士 國立臺灣師範大學 資訊工程學系 101 Pseudo-relevance feedback is by far the most commonly-used paradigm for query reformulation in spoken document retrieval, which assumes that a small amount of top-ranked feedback documents obtained from the initial retrieval are relevant and can be utilized for query expansion. Nevertheless, simply taking all of the top-ranked feedback documents acquired from the initial retrieval for query modeling does not necessary work well, especially when the top-ranked documents contain much redundant or non-relevant cues. In view of this, we explore different kinds of information cues for selecting helpful feedback documents to further improve information retrieval. On the other hand, relevance model (RM) based on “bag-of-words” assumption, which can facilitate the derivation and estimation, may be oversimplified for the task of language modeling in speech recognition. Hence, we also enhance RM in two significant aspects. First, “bag-of-words” assumption of RM is relaxed by incorporating word proximity information into RM formulation. Second, topic-based proximity information is additionally explored to further enhance the proximity-based RM framework. Experiments conducted on not only a spoken document retrieval task but also a speech recognition task indicates that our approaches can bring competitive utilities to existing ones. Berlin Chen 陳柏琳 2013 學位論文 ; thesis 71 en_US
collection NDLTD
language en_US
format Others
sources NDLTD
description 碩士 === 國立臺灣師範大學 === 資訊工程學系 === 101 === Pseudo-relevance feedback is by far the most commonly-used paradigm for query reformulation in spoken document retrieval, which assumes that a small amount of top-ranked feedback documents obtained from the initial retrieval are relevant and can be utilized for query expansion. Nevertheless, simply taking all of the top-ranked feedback documents acquired from the initial retrieval for query modeling does not necessary work well, especially when the top-ranked documents contain much redundant or non-relevant cues. In view of this, we explore different kinds of information cues for selecting helpful feedback documents to further improve information retrieval. On the other hand, relevance model (RM) based on “bag-of-words” assumption, which can facilitate the derivation and estimation, may be oversimplified for the task of language modeling in speech recognition. Hence, we also enhance RM in two significant aspects. First, “bag-of-words” assumption of RM is relaxed by incorporating word proximity information into RM formulation. Second, topic-based proximity information is additionally explored to further enhance the proximity-based RM framework. Experiments conducted on not only a spoken document retrieval task but also a speech recognition task indicates that our approaches can bring competitive utilities to existing ones.
author2 Berlin Chen
author_facet Berlin Chen
陳憶文
author 陳憶文
spellingShingle 陳憶文
Exploring Effective Pseudo-Relevance Feedback and Proximity Information for Speech Retrieval and Transcription
author_sort 陳憶文
title Exploring Effective Pseudo-Relevance Feedback and Proximity Information for Speech Retrieval and Transcription
title_short Exploring Effective Pseudo-Relevance Feedback and Proximity Information for Speech Retrieval and Transcription
title_full Exploring Effective Pseudo-Relevance Feedback and Proximity Information for Speech Retrieval and Transcription
title_fullStr Exploring Effective Pseudo-Relevance Feedback and Proximity Information for Speech Retrieval and Transcription
title_full_unstemmed Exploring Effective Pseudo-Relevance Feedback and Proximity Information for Speech Retrieval and Transcription
title_sort exploring effective pseudo-relevance feedback and proximity information for speech retrieval and transcription
publishDate 2013
url http://ndltd.ncl.edu.tw/handle/24695216658836083699
work_keys_str_mv AT chényìwén exploringeffectivepseudorelevancefeedbackandproximityinformationforspeechretrievalandtranscription
AT chényìwén tànsuǒxūnǐguānliánhuíkuìjìshùhélínjìnzīxùnyúyǔyīnwénjiànjiǎnsuǒyǔbiànshízhīgǎijìn
_version_ 1718208304063184896