Improving Pseudo-Relevance Feedback With Neural Network-Based Word Representations

In information retrieval, query expansion methods, such as pseudo-relevance feedback, are designed to enrich users' queries with relevant terms for comprehensively interpreting the desired information. One of the key issues for query expansion is how to obtain high-quality expansion terms to ca...

Full description

Bibliographic Details
Main Authors: Bo Xu, Hongfei Lin, Yuan Lin, Liang Yang, Kan Xu
Format: Article
Language:English
Published: IEEE 2018-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/8493557/
id doaj-4734465e384a4278ae32f15bea5021f8
record_format Article
spelling doaj-4734465e384a4278ae32f15bea5021f82021-03-29T21:27:03ZengIEEEIEEE Access2169-35362018-01-016621526216510.1109/ACCESS.2018.28764258493557Improving Pseudo-Relevance Feedback With Neural Network-Based Word RepresentationsBo Xu0https://orcid.org/0000-0001-5453-978XHongfei Lin1Yuan Lin2Liang Yang3Kan Xu4Faculty of Electronic Information and Electrical Engineering, Dalian University of Technology, Dalian, ChinaFaculty of Electronic Information and Electrical Engineering, Dalian University of Technology, Dalian, ChinaWISE Lab, School of Public Administration and Law, Dalian University of Technology, Dalian, ChinaFaculty of Electronic Information and Electrical Engineering, Dalian University of Technology, Dalian, ChinaFaculty of Electronic Information and Electrical Engineering, Dalian University of Technology, Dalian, ChinaIn information retrieval, query expansion methods, such as pseudo-relevance feedback, are designed to enrich users' queries with relevant terms for comprehensively interpreting the desired information. One of the key issues for query expansion is how to obtain high-quality expansion terms to capture the information needs. Recent advances in neural network language models have demonstrated that these models can learn powerful distributed word representations, which have been successfully applied to solve various natural language processing tasks. In this paper, we propose a novel query expansion framework based on neural network-based word representations. Our framework first selects abundant candidate expansion terms using a modified term-dependency method and then generates term features for candidate terms based on word representations to encode relationships between given queries and corresponding candidate terms. Furthermore, we adopt learning-to-rank methods to train term-ranking models with the generated features for term refinement. We conduct extensive experiments to examine the performance of the learned term-ranking models and compare the effectiveness of the representation-based and context-based features for selecting relevant expansion terms. Experimental results using four TREC collections show that neural network-based word representations are effective in query expansion and can significantly improve retrieval performance.https://ieeexplore.ieee.org/document/8493557/Information retrievallearning-to-rankpseudo-relevance feedbackword representations
collection DOAJ
language English
format Article
sources DOAJ
author Bo Xu
Hongfei Lin
Yuan Lin
Liang Yang
Kan Xu
spellingShingle Bo Xu
Hongfei Lin
Yuan Lin
Liang Yang
Kan Xu
Improving Pseudo-Relevance Feedback With Neural Network-Based Word Representations
IEEE Access
Information retrieval
learning-to-rank
pseudo-relevance feedback
word representations
author_facet Bo Xu
Hongfei Lin
Yuan Lin
Liang Yang
Kan Xu
author_sort Bo Xu
title Improving Pseudo-Relevance Feedback With Neural Network-Based Word Representations
title_short Improving Pseudo-Relevance Feedback With Neural Network-Based Word Representations
title_full Improving Pseudo-Relevance Feedback With Neural Network-Based Word Representations
title_fullStr Improving Pseudo-Relevance Feedback With Neural Network-Based Word Representations
title_full_unstemmed Improving Pseudo-Relevance Feedback With Neural Network-Based Word Representations
title_sort improving pseudo-relevance feedback with neural network-based word representations
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2018-01-01
description In information retrieval, query expansion methods, such as pseudo-relevance feedback, are designed to enrich users' queries with relevant terms for comprehensively interpreting the desired information. One of the key issues for query expansion is how to obtain high-quality expansion terms to capture the information needs. Recent advances in neural network language models have demonstrated that these models can learn powerful distributed word representations, which have been successfully applied to solve various natural language processing tasks. In this paper, we propose a novel query expansion framework based on neural network-based word representations. Our framework first selects abundant candidate expansion terms using a modified term-dependency method and then generates term features for candidate terms based on word representations to encode relationships between given queries and corresponding candidate terms. Furthermore, we adopt learning-to-rank methods to train term-ranking models with the generated features for term refinement. We conduct extensive experiments to examine the performance of the learned term-ranking models and compare the effectiveness of the representation-based and context-based features for selecting relevant expansion terms. Experimental results using four TREC collections show that neural network-based word representations are effective in query expansion and can significantly improve retrieval performance.
topic Information retrieval
learning-to-rank
pseudo-relevance feedback
word representations
url https://ieeexplore.ieee.org/document/8493557/
work_keys_str_mv AT boxu improvingpseudorelevancefeedbackwithneuralnetworkbasedwordrepresentations
AT hongfeilin improvingpseudorelevancefeedbackwithneuralnetworkbasedwordrepresentations
AT yuanlin improvingpseudorelevancefeedbackwithneuralnetworkbasedwordrepresentations
AT liangyang improvingpseudorelevancefeedbackwithneuralnetworkbasedwordrepresentations
AT kanxu improvingpseudorelevancefeedbackwithneuralnetworkbasedwordrepresentations
_version_ 1724192864183779328