Improved data retrieval from TreeBASE via taxonomic and linguistic data enrichment

<p>Abstract</p> <p>Background</p> <p>TreeBASE, the only data repository for phylogenetic studies, is not being used effectively since it does not meet the taxonomic data retrieval requirements of the systematics community. We show, through an examination of the queries...

Full description

Bibliographic Details
Main Authors: Hunt Ela, Anwar Nadia
Format: Article
Language:English
Published: BMC 2009-05-01
Series:BMC Evolutionary Biology
Online Access:http://www.biomedcentral.com/1471-2148/9/93
id doaj-355eba4fa83142eebd4b53b851395bf5
record_format Article
spelling doaj-355eba4fa83142eebd4b53b851395bf52021-09-02T07:29:27ZengBMCBMC Evolutionary Biology1471-21482009-05-01919310.1186/1471-2148-9-93Improved data retrieval from TreeBASE via taxonomic and linguistic data enrichmentHunt ElaAnwar Nadia<p>Abstract</p> <p>Background</p> <p>TreeBASE, the only data repository for phylogenetic studies, is not being used effectively since it does not meet the taxonomic data retrieval requirements of the systematics community. We show, through an examination of the queries performed on TreeBASE, that data retrieval using taxon names is unsatisfactory.</p> <p>Results</p> <p>We report on a new wrapper supporting taxon queries on TreeBASE by utilising a Taxonomy and Classification Database (TCl-Db) we created. TCl-Db holds merged and consolidated taxonomic names from multiple data sources and can be used to translate hierarchical, vernacular and synonym queries into specific query terms in TreeBASE. The query expansion supported by TCl-Db shows very significant information retrieval quality improvement. The wrapper can be accessed at the URL <url>http://spira.zoology.gla.ac.uk/app/tbasewrapper.php</url></p> <p>The methodology we developed is scalable and can be applied to new data, as those become available in the future.</p> <p>Conclusion</p> <p>Significantly improved data retrieval quality is shown for all queries, and additional flexibility is achieved via user-driven taxonomy selection.</p> http://www.biomedcentral.com/1471-2148/9/93
collection DOAJ
language English
format Article
sources DOAJ
author Hunt Ela
Anwar Nadia
spellingShingle Hunt Ela
Anwar Nadia
Improved data retrieval from TreeBASE via taxonomic and linguistic data enrichment
BMC Evolutionary Biology
author_facet Hunt Ela
Anwar Nadia
author_sort Hunt Ela
title Improved data retrieval from TreeBASE via taxonomic and linguistic data enrichment
title_short Improved data retrieval from TreeBASE via taxonomic and linguistic data enrichment
title_full Improved data retrieval from TreeBASE via taxonomic and linguistic data enrichment
title_fullStr Improved data retrieval from TreeBASE via taxonomic and linguistic data enrichment
title_full_unstemmed Improved data retrieval from TreeBASE via taxonomic and linguistic data enrichment
title_sort improved data retrieval from treebase via taxonomic and linguistic data enrichment
publisher BMC
series BMC Evolutionary Biology
issn 1471-2148
publishDate 2009-05-01
description <p>Abstract</p> <p>Background</p> <p>TreeBASE, the only data repository for phylogenetic studies, is not being used effectively since it does not meet the taxonomic data retrieval requirements of the systematics community. We show, through an examination of the queries performed on TreeBASE, that data retrieval using taxon names is unsatisfactory.</p> <p>Results</p> <p>We report on a new wrapper supporting taxon queries on TreeBASE by utilising a Taxonomy and Classification Database (TCl-Db) we created. TCl-Db holds merged and consolidated taxonomic names from multiple data sources and can be used to translate hierarchical, vernacular and synonym queries into specific query terms in TreeBASE. The query expansion supported by TCl-Db shows very significant information retrieval quality improvement. The wrapper can be accessed at the URL <url>http://spira.zoology.gla.ac.uk/app/tbasewrapper.php</url></p> <p>The methodology we developed is scalable and can be applied to new data, as those become available in the future.</p> <p>Conclusion</p> <p>Significantly improved data retrieval quality is shown for all queries, and additional flexibility is achieved via user-driven taxonomy selection.</p>
url http://www.biomedcentral.com/1471-2148/9/93
work_keys_str_mv AT huntela improveddataretrievalfromtreebaseviataxonomicandlinguisticdataenrichment
AT anwarnadia improveddataretrievalfromtreebaseviataxonomicandlinguisticdataenrichment
_version_ 1721178400098353152