Relevance feedback-based optimization of search queries for Patents

In this project, we design a search query optimization system based on the user’s relevance feedback by generating customized query strings for existing patent alerts. Firstly, the Rocchio algorithm is used to generate a search string by analyzing the characteristics of related patents and unrelated...

Full description

Bibliographic Details
Main Author:	Cheng, Sijin
Format:	Others
Language:	English
Published:	Linköpings universitet, Interaktiva och kognitiva system 2019
Subjects:	Patent Search Query Reformulation Recommendation System Matrix Decomposition Text Processing Computer Systems Datorsystem
Online Access:	http://urn.kb.se/resolve?urn=urn:nbn:se:liu:diva-154173

id	ndltd-UPSALLA1-oai-DiVA.org-liu-154173
record_format	oai_dc
spelling	ndltd-UPSALLA1-oai-DiVA.org-liu-1541732019-02-02T06:10:02ZRelevance feedback-based optimization of search queries for PatentsengCheng, SijinLinköpings universitet, Interaktiva och kognitiva system2019Patent SearchQuery ReformulationRecommendation SystemMatrix DecompositionText ProcessingComputer SystemsDatorsystemIn this project, we design a search query optimization system based on the user’s relevance feedback by generating customized query strings for existing patent alerts. Firstly, the Rocchio algorithm is used to generate a search string by analyzing the characteristics of related patents and unrelated patents. Then the collaborative filtering recommendation algorithm is used to rank the query results, which considering the previous relevance feedback and patent features, instead of only considering the similarity between query and patents as the traditional method. In order to further explore the performance of the optimization system, we design and conduct a series of evaluation experiments regarding TF-IDF as a baseline method. Experiments show that, with the use of generated search strings, the proportion of unrelated patents in search results is significantly reduced over time. In 4 months, the precision of the retrieved results is optimized from 53.5% to 72%. What’s more, the rank performance of the method we proposed is better than the baseline method. In terms of precision, top10 of recommendation algorithm is about 5 percentage points higher than the baseline method, and top20 is about 7.5% higher. It can be concluded that the approach we proposed can effectively optimize patent search results by learning relevance feedback. Student thesisinfo:eu-repo/semantics/bachelorThesistexthttp://urn.kb.se/resolve?urn=urn:nbn:se:liu:diva-154173application/pdfinfo:eu-repo/semantics/openAccess
collection	NDLTD
language	English
format	Others
sources	NDLTD
topic	Patent Search Query Reformulation Recommendation System Matrix Decomposition Text Processing Computer Systems Datorsystem
spellingShingle	Patent Search Query Reformulation Recommendation System Matrix Decomposition Text Processing Computer Systems Datorsystem Cheng, Sijin Relevance feedback-based optimization of search queries for Patents
description	In this project, we design a search query optimization system based on the user’s relevance feedback by generating customized query strings for existing patent alerts. Firstly, the Rocchio algorithm is used to generate a search string by analyzing the characteristics of related patents and unrelated patents. Then the collaborative filtering recommendation algorithm is used to rank the query results, which considering the previous relevance feedback and patent features, instead of only considering the similarity between query and patents as the traditional method. In order to further explore the performance of the optimization system, we design and conduct a series of evaluation experiments regarding TF-IDF as a baseline method. Experiments show that, with the use of generated search strings, the proportion of unrelated patents in search results is significantly reduced over time. In 4 months, the precision of the retrieved results is optimized from 53.5% to 72%. What’s more, the rank performance of the method we proposed is better than the baseline method. In terms of precision, top10 of recommendation algorithm is about 5 percentage points higher than the baseline method, and top20 is about 7.5% higher. It can be concluded that the approach we proposed can effectively optimize patent search results by learning relevance feedback.
author	Cheng, Sijin
author_facet	Cheng, Sijin
author_sort	Cheng, Sijin
title	Relevance feedback-based optimization of search queries for Patents
title_short	Relevance feedback-based optimization of search queries for Patents
title_full	Relevance feedback-based optimization of search queries for Patents
title_fullStr	Relevance feedback-based optimization of search queries for Patents
title_full_unstemmed	Relevance feedback-based optimization of search queries for Patents
title_sort	relevance feedback-based optimization of search queries for patents
publisher	Linköpings universitet, Interaktiva och kognitiva system
publishDate	2019
url	http://urn.kb.se/resolve?urn=urn:nbn:se:liu:diva-154173
work_keys_str_mv	AT chengsijin relevancefeedbackbasedoptimizationofsearchqueriesforpatents
_version_	1718969926580961280

Relevance feedback-based optimization of search queries for Patents

Similar Items