Domain fusion analysis by applying relational algebra to protein sequence and domain databases

<p>Abstract</p> <p>Background</p> <p>Domain fusion analysis is a useful method to predict functionally linked proteins that may be involved in direct protein-protein interactions or in the same metabolic or signaling pathway. As separate domain databases like BLOCKS, PR...

Full description

Bibliographic Details
Main Authors: Ikura Mitsuhiko, Truong Kevin
Format: Article
Language:English
Published: BMC 2003-05-01
Series:BMC Bioinformatics
Online Access:http://www.biomedcentral.com/1471-2105/4/16
id doaj-75d61a99436843b0a2d56a1d00b2101e
record_format Article
spelling doaj-75d61a99436843b0a2d56a1d00b2101e2020-11-25T01:47:05ZengBMCBMC Bioinformatics1471-21052003-05-01411610.1186/1471-2105-4-16Domain fusion analysis by applying relational algebra to protein sequence and domain databasesIkura MitsuhikoTruong Kevin<p>Abstract</p> <p>Background</p> <p>Domain fusion analysis is a useful method to predict functionally linked proteins that may be involved in direct protein-protein interactions or in the same metabolic or signaling pathway. As separate domain databases like BLOCKS, PROSITE, Pfam, SMART, PRINTS-S, ProDom, TIGRFAMs, and amalgamated domain databases like InterPro continue to grow in size and quality, a computational method to perform domain fusion analysis that leverages on these efforts will become increasingly powerful.</p> <p>Results</p> <p>This paper proposes a computational method employing relational algebra to find domain fusions in protein sequence databases. The feasibility of this method was illustrated on the SWISS-PROT+TrEMBL sequence database using domain predictions from the Pfam HMM (hidden Markov model) database. We identified 235 and 189 putative functionally linked protein partners in <it>H. sapiens </it>and <it>S. cerevisiae</it>, respectively. From scientific literature, we were able to confirm many of these functional linkages, while the remainder offer testable experimental hypothesis. Results can be viewed at <url>http://calcium.uhnres.utoronto.ca/pi</url>.</p> <p>Conclusion</p> <p>As the analysis can be computed quickly on any relational database that supports standard SQL (structured query language), it can be dynamically updated along with the sequence and domain databases, thereby improving the quality of predictions over time.</p> http://www.biomedcentral.com/1471-2105/4/16
collection DOAJ
language English
format Article
sources DOAJ
author Ikura Mitsuhiko
Truong Kevin
spellingShingle Ikura Mitsuhiko
Truong Kevin
Domain fusion analysis by applying relational algebra to protein sequence and domain databases
BMC Bioinformatics
author_facet Ikura Mitsuhiko
Truong Kevin
author_sort Ikura Mitsuhiko
title Domain fusion analysis by applying relational algebra to protein sequence and domain databases
title_short Domain fusion analysis by applying relational algebra to protein sequence and domain databases
title_full Domain fusion analysis by applying relational algebra to protein sequence and domain databases
title_fullStr Domain fusion analysis by applying relational algebra to protein sequence and domain databases
title_full_unstemmed Domain fusion analysis by applying relational algebra to protein sequence and domain databases
title_sort domain fusion analysis by applying relational algebra to protein sequence and domain databases
publisher BMC
series BMC Bioinformatics
issn 1471-2105
publishDate 2003-05-01
description <p>Abstract</p> <p>Background</p> <p>Domain fusion analysis is a useful method to predict functionally linked proteins that may be involved in direct protein-protein interactions or in the same metabolic or signaling pathway. As separate domain databases like BLOCKS, PROSITE, Pfam, SMART, PRINTS-S, ProDom, TIGRFAMs, and amalgamated domain databases like InterPro continue to grow in size and quality, a computational method to perform domain fusion analysis that leverages on these efforts will become increasingly powerful.</p> <p>Results</p> <p>This paper proposes a computational method employing relational algebra to find domain fusions in protein sequence databases. The feasibility of this method was illustrated on the SWISS-PROT+TrEMBL sequence database using domain predictions from the Pfam HMM (hidden Markov model) database. We identified 235 and 189 putative functionally linked protein partners in <it>H. sapiens </it>and <it>S. cerevisiae</it>, respectively. From scientific literature, we were able to confirm many of these functional linkages, while the remainder offer testable experimental hypothesis. Results can be viewed at <url>http://calcium.uhnres.utoronto.ca/pi</url>.</p> <p>Conclusion</p> <p>As the analysis can be computed quickly on any relational database that supports standard SQL (structured query language), it can be dynamically updated along with the sequence and domain databases, thereby improving the quality of predictions over time.</p>
url http://www.biomedcentral.com/1471-2105/4/16
work_keys_str_mv AT ikuramitsuhiko domainfusionanalysisbyapplyingrelationalalgebratoproteinsequenceanddomaindatabases
AT truongkevin domainfusionanalysisbyapplyingrelationalalgebratoproteinsequenceanddomaindatabases
_version_ 1725016479747801088