Relemed: sentence-level search engine with relevance score for the MEDLINE database of biomedical articles

<p>Abstract</p> <p>Background</p> <p>Receiving extraneous articles in response to a query submitted to MEDLINE/PubMed is common. When submitting a multi-word query (which is the majority of queries submitted), the presence of all query words within each article may be a...

Full description

Bibliographic Details
Main Authors: Knaus William A, Shu Jianfen, Siadaty Mir S
Format: Article
Language:English
Published: BMC 2007-01-01
Series:BMC Medical Informatics and Decision Making
Online Access:http://www.biomedcentral.com/1472-6947/7/1
id doaj-e00dcc87f94444539d63caead959c6cb
record_format Article
spelling doaj-e00dcc87f94444539d63caead959c6cb2020-11-24T21:33:41ZengBMCBMC Medical Informatics and Decision Making1472-69472007-01-0171110.1186/1472-6947-7-1Relemed: sentence-level search engine with relevance score for the MEDLINE database of biomedical articlesKnaus William AShu JianfenSiadaty Mir S<p>Abstract</p> <p>Background</p> <p>Receiving extraneous articles in response to a query submitted to MEDLINE/PubMed is common. When submitting a multi-word query (which is the majority of queries submitted), the presence of all query words within each article may be a necessary condition for retrieving relevant articles, but not sufficient. Ideally a relationship between the query words in the article is also required. We propose that if two words occur within an article, the probability that a relation between them is explained is higher when the words occur within adjacent sentences versus remote sentences. Therefore, sentence-level concurrence can be used as a surrogate for existence of the relationship between the words.</p> <p>In order to avoid the irrelevant articles, one solution would be to increase the search specificity. Another solution is to estimate a relevance score to sort the retrieved articles. However among the >30 retrieval services available for MEDLINE, only a few estimate a relevance score, and none detects and incorporates the relation between the query words as part of the relevance score.</p> <p>Results</p> <p>We have developed "Relemed", a search engine for MEDLINE. Relemed increases specificity and precision of retrieval by searching for query words within sentences rather than the whole article. It uses sentence-level concurrence as a statistical surrogate for the existence of relationship between the words. It also estimates a relevance score and sorts the results on this basis, thus shifting irrelevant articles lower down the list.</p> <p>In two case studies, we demonstrate that the most relevant articles appear at the top of the Relemed results, while this is not necessarily the case with a PubMed search. We have also shown that a Relemed search includes not only all the articles retrieved by PubMed, but potentially additional relevant articles, due to the extended 'automatic term mapping' and text-word searching features implemented in Relemed.</p> <p>Conclusion</p> <p>By using sentence-level matching, Relemed can deliver higher specificity, thus eliminating more false-positive articles. By introducing an appropriate relevance metric, the most relevant articles on which the user wishes to focus are listed first. Relemed also shrinks the displayed text, and hence the time spent scanning the articles.</p> http://www.biomedcentral.com/1472-6947/7/1
collection DOAJ
language English
format Article
sources DOAJ
author Knaus William A
Shu Jianfen
Siadaty Mir S
spellingShingle Knaus William A
Shu Jianfen
Siadaty Mir S
Relemed: sentence-level search engine with relevance score for the MEDLINE database of biomedical articles
BMC Medical Informatics and Decision Making
author_facet Knaus William A
Shu Jianfen
Siadaty Mir S
author_sort Knaus William A
title Relemed: sentence-level search engine with relevance score for the MEDLINE database of biomedical articles
title_short Relemed: sentence-level search engine with relevance score for the MEDLINE database of biomedical articles
title_full Relemed: sentence-level search engine with relevance score for the MEDLINE database of biomedical articles
title_fullStr Relemed: sentence-level search engine with relevance score for the MEDLINE database of biomedical articles
title_full_unstemmed Relemed: sentence-level search engine with relevance score for the MEDLINE database of biomedical articles
title_sort relemed: sentence-level search engine with relevance score for the medline database of biomedical articles
publisher BMC
series BMC Medical Informatics and Decision Making
issn 1472-6947
publishDate 2007-01-01
description <p>Abstract</p> <p>Background</p> <p>Receiving extraneous articles in response to a query submitted to MEDLINE/PubMed is common. When submitting a multi-word query (which is the majority of queries submitted), the presence of all query words within each article may be a necessary condition for retrieving relevant articles, but not sufficient. Ideally a relationship between the query words in the article is also required. We propose that if two words occur within an article, the probability that a relation between them is explained is higher when the words occur within adjacent sentences versus remote sentences. Therefore, sentence-level concurrence can be used as a surrogate for existence of the relationship between the words.</p> <p>In order to avoid the irrelevant articles, one solution would be to increase the search specificity. Another solution is to estimate a relevance score to sort the retrieved articles. However among the >30 retrieval services available for MEDLINE, only a few estimate a relevance score, and none detects and incorporates the relation between the query words as part of the relevance score.</p> <p>Results</p> <p>We have developed "Relemed", a search engine for MEDLINE. Relemed increases specificity and precision of retrieval by searching for query words within sentences rather than the whole article. It uses sentence-level concurrence as a statistical surrogate for the existence of relationship between the words. It also estimates a relevance score and sorts the results on this basis, thus shifting irrelevant articles lower down the list.</p> <p>In two case studies, we demonstrate that the most relevant articles appear at the top of the Relemed results, while this is not necessarily the case with a PubMed search. We have also shown that a Relemed search includes not only all the articles retrieved by PubMed, but potentially additional relevant articles, due to the extended 'automatic term mapping' and text-word searching features implemented in Relemed.</p> <p>Conclusion</p> <p>By using sentence-level matching, Relemed can deliver higher specificity, thus eliminating more false-positive articles. By introducing an appropriate relevance metric, the most relevant articles on which the user wishes to focus are listed first. Relemed also shrinks the displayed text, and hence the time spent scanning the articles.</p>
url http://www.biomedcentral.com/1472-6947/7/1
work_keys_str_mv AT knauswilliama relemedsentencelevelsearchenginewithrelevancescoreforthemedlinedatabaseofbiomedicalarticles
AT shujianfen relemedsentencelevelsearchenginewithrelevancescoreforthemedlinedatabaseofbiomedicalarticles
AT siadatymirs relemedsentencelevelsearchenginewithrelevancescoreforthemedlinedatabaseofbiomedicalarticles
_version_ 1725952507583135744